Models
Browse 22 canonical LLM models across all providers
Qwen 3.7 Plus
Alibaba's multimodal variant in the Qwen 3.7 family, optimized for vision understanding and multimodal tasks. Ranked
$0.80 – $2.40 / 1M tokens
Qwen 3.7 Max
Alibaba's flagship proprietary model engineered for advanced agentic coding, complex reasoning, and long-horizon task execution. Ranked
$1.30 – $7.80 / 1M tokens
MiniCPM-V 4.6
Ultra-efficient multimodal language model from OpenBMB built on SigLIP2-400M and Qwen3.5-0.8B (~1B parameters). Supports single-image, multi-image, and video understanding with mixed 4x/16x visual token compression. Designed for edge deployment on iOS, Android, and HarmonyOS.
DeepSeek V4 Pro
DeepSeek's flagship V4 model with 1.6T total parameters (49B activated). MoE architecture supporting 1M token context. Closes the gap with frontier proprietary models on reasoning and coding benchmarks.
$0.00 – $2.19 / 1M tokens
DeepSeek V4 Flash
DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.
$0.00 – $0.40 / 1M tokens
Qwen 3.6 27B
Alibaba's dense 27B parameter model that outperforms its own 397B MoE predecessor on agentic coding benchmarks. Strong multilingual and reasoning capabilities released under Apache 2.0.
$0.20 – $0.60 / 1M tokens
Hy3 Preview
Tencent's flagship open-weight Mixture-of-Experts model from the Hunyuan family with 295B total parameters and 21B active. Integrates fast and slow thinking modes with configurable reasoning effort. Designed for agentic workflows, cross-file code refactoring, long-document analysis, and multi-step tool use.
$0.00 – $0.28 / 1M tokens
MiMo-V2.5-Pro
Xiaomi's flagship 1.02T-parameter Mixture-of-Experts model with 42B active parameters, built on a hybrid-attention architecture with 3-layer Multi-Token Prediction. Designed for complex agentic tasks, software engineering, and long-horizon instruction following with a 1M-token context window.
$1.00 – $3.00 / 1M tokens
Qwen 3.6 35B-A3B
Alibaba's efficient Mixture-of-Experts model with 35B total parameters and 3B active per token. Frontier-level agentic coding performance with 73.4% on SWE-bench Verified and 92.7 on AIME 2026. Released under Apache 2.0.
$0.14 – $0.42 / 1M tokens
Qwen 3.6 Plus
Alibaba's proprietary flagship model in the Qwen 3.6 family, targeting enterprise AI workflows with stronger agentic coding capability, visual coding support, and end-to-end enterprise engineering features.
$0.80 – $2.40 / 1M tokens
MiniMax M2.7
MiniMax's latest large language model with strong multilingual and multimodal capabilities. Competitive pricing with high-quality text generation and improved reasoning performance.
$0.00 – $1.50 / 1M tokens
Kimi K2.6
Moonshot AI's latest model with ultra-long context window support, strong reasoning capabilities, and excellent performance on complex multi-step tasks. Known for reliable long-document understanding.
$0.00 – $3.00 / 1M tokens
Qwen 3.6
Alibaba's latest Qwen model with enhanced reasoning, multilingual capabilities, and improved instruction following. Features strong performance on coding, math, and general knowledge benchmarks.
$0.30 – $0.90 / 1M tokens
GLM-5.1
Zhipu AI's latest bilingual model with strong Chinese and English capabilities. Features improved reasoning, coding, and tool use with competitive performance on academic benchmarks.
$0.00 – $3.00 / 1M tokens
DeepSeek V4
DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.
$0.14 – $1.10 / 1M tokens
GLM-4.7
Zhipu AI's multilingual agentic coding model with strong reasoning, tool use, and UI generation capabilities. Predecessor to GLM-5.1 with competitive performance on coding benchmarks.
$0.00 – $1.50 / 1M tokens
Qwen3 235B
Alibaba's Qwen3 235B mixture-of-experts model delivering frontier-level performance with advanced reasoning, function calling, and code generation capabilities at massive scale.
Qwen3 32B
Alibaba's Qwen3 32B dense language model with strong reasoning and multilingual capabilities, supporting function calling and code generation across diverse tasks.
$0.16 – $2.24 / 1M tokens
Qwen3 Coder
Alibaba's Qwen3 Coder model optimized for software development tasks including code generation, debugging, code review, and technical documentation with strong multilingual programming support.
$0.20 – $0.60 / 1M tokens
QwQ 32B
Alibaba's QwQ 32B reasoning-focused model designed for complex problem solving, mathematical reasoning, and step-by-step logical analysis with strong chain-of-thought capabilities.
DeepSeek R1
DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.
$0.40 – $7.00 / 1M tokens
DeepSeek V3
DeepSeek's third-generation large language model featuring mixture-of-experts architecture, strong multilingual capabilities, and competitive performance on reasoning and coding benchmarks.
$0.27 – $1.10 / 1M tokens