Models

Browse 27 canonical LLM models across all providers

Sort by

Compare Models

Showing 1–24 of 27 models

GLM-5.2

China2 providers1.0M ctx

Z.ai's (formerly Zhipu AI) flagship open-weight coding model with a 1M-token context window. Mixture-of-Experts architecture with 753B total parameters and ~40B active per request, featuring two cost-balancing reasoning modes. Tops several coding benchmarks while remaining a fraction of the cost of comparable proprietary frontier models. MIT-licensed weights.

$0.60 – $2.00 / 1M tokens

Kimi K2.7 Code

China3 providers1.0M ctx

Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.

$0.60 – $3.00 / 1M tokens

MiniMax M3

China3 providers1.0M ctx

MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.

$0.30 – $2.40 / 1M tokens

Ring-2.6-1T

China131K ctx

InclusionAI's (Ant Group) trillion-parameter open-weights reasoning model with 63B active parameters per token. Built for real-world agent workflows with adaptive reasoning-effort modes. Features hybrid linear and MLA attention architecture with MIT license.

$0.00 – $0.00 / 1M tokens

Yi-Lightning

China131K ctx

01.AI's flagship large language model with enhanced Mixture-of-Experts architecture. Ranked 6th on Chatbot Arena with particularly strong results in Chinese, Math, Coding, and Hard Prompts categories. Features advanced expert segmentation and optimized KV-caching.

$0.99 – $0.99 / 1M tokens

Qwen 3.7 Plus

China131K ctx

Alibaba's multimodal variant in the Qwen 3.7 family, optimized for vision understanding and multimodal tasks. Ranked

$0.80 – $2.40 / 1M tokens

Qwen 3.7 Max

China131K ctx

Alibaba's flagship proprietary model engineered for advanced agentic coding, complex reasoning, and long-horizon task execution. Ranked

$1.30 – $7.80 / 1M tokens

MiniCPM-V 4.6

China256K ctx

Ultra-efficient multimodal language model from OpenBMB built on SigLIP2-400M and Qwen3.5-0.8B (~1B parameters). Supports single-image, multi-image, and video understanding with mixed 4x/16x visual token compression. Designed for edge deployment on iOS, Android, and HarmonyOS.

DeepSeek V4 Pro

China2 providers1.0M ctx

DeepSeek's flagship V4 model with 1.6T total parameters (49B activated). MoE architecture supporting 1M token context. Closes the gap with frontier proprietary models on reasoning and coding benchmarks.

$0.00 – $2.19 / 1M tokens

DeepSeek V4 Flash

China3 providers1.0M ctx

DeepSeek's efficient V4 model with 284B total parameters (13B activated). Optimized for speed and cost-efficiency while maintaining strong performance. Supports 1M token context window.

$0.00 – $0.40 / 1M tokens

MiMo-V2.5-Pro

China2 providers1.0M ctx

Xiaomi's flagship 1.02T-parameter Mixture-of-Experts model with 42B active parameters, built on a hybrid-attention architecture with 3-layer Multi-Token Prediction. Designed for complex agentic tasks, software engineering, and long-horizon instruction following with a 1M-token context window.

$1.00 – $3.00 / 1M tokens

Hy3 Preview

China2 providers256K ctx

Tencent's flagship open-weight Mixture-of-Experts model from the Hunyuan family with 295B total parameters and 21B active. Integrates fast and slow thinking modes with configurable reasoning effort. Designed for agentic workflows, cross-file code refactoring, long-document analysis, and multi-step tool use.

$0.00 – $0.28 / 1M tokens

Qwen 3.6 27B

China131K ctx

Alibaba's dense 27B parameter model that outperforms its own 397B MoE predecessor on agentic coding benchmarks. Strong multilingual and reasoning capabilities released under Apache 2.0.

$0.20 – $0.60 / 1M tokens

Qwen 3.6 35B-A3B

China131K ctx

Alibaba's efficient Mixture-of-Experts model with 35B total parameters and 3B active per token. Frontier-level agentic coding performance with 73.4% on SWE-bench Verified and 92.7 on AIME 2026. Released under Apache 2.0.

$0.14 – $0.42 / 1M tokens

Qwen 3.6 Plus

China131K ctx

Alibaba's proprietary flagship model in the Qwen 3.6 family, targeting enterprise AI workflows with stronger agentic coding capability, visual coding support, and end-to-end enterprise engineering features.

$0.80 – $2.40 / 1M tokens

GLM-5.1

China2 providers131K ctx

Zhipu AI's latest bilingual model with strong Chinese and English capabilities. Features improved reasoning, coding, and tool use with competitive performance on academic benchmarks.

$0.00 – $3.00 / 1M tokens

Kimi K2.6

China2 providers1.0M ctx

Moonshot AI's latest model with ultra-long context window support, strong reasoning capabilities, and excellent performance on complex multi-step tasks. Known for reliable long-document understanding.

$0.00 – $3.00 / 1M tokens

MiniMax M2.7

China2 providers200K ctx

MiniMax's latest large language model with strong multilingual and multimodal capabilities. Competitive pricing with high-quality text generation and improved reasoning performance.

$0.00 – $1.50 / 1M tokens

Qwen 3.6

China131K ctx

Alibaba's latest Qwen model with enhanced reasoning, multilingual capabilities, and improved instruction following. Features strong performance on coding, math, and general knowledge benchmarks.

$0.30 – $0.90 / 1M tokens

DeepSeek V4

China6 providers256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

$0.14 – $1.10 / 1M tokens

GLM-4.7

China2 providers131K ctx

Zhipu AI's multilingual agentic coding model with strong reasoning, tool use, and UI generation capabilities. Predecessor to GLM-5.1 with competitive performance on coding benchmarks.

$0.00 – $1.50 / 1M tokens

Qwen3 235B

China131K ctx

Alibaba's Qwen3 235B mixture-of-experts model delivering frontier-level performance with advanced reasoning, function calling, and code generation capabilities at massive scale.

Qwen3 32B

China5 providers131K ctx

Alibaba's Qwen3 32B dense language model with strong reasoning and multilingual capabilities, supporting function calling and code generation across diverse tasks.

$0.16 – $2.24 / 1M tokens

Qwen3 Coder

China131K ctx

Alibaba's Qwen3 Coder model optimized for software development tasks including code generation, debugging, code review, and technical documentation with strong multilingual programming support.

$0.20 – $0.60 / 1M tokens

1 2 Next →