Models

Browse 24 canonical LLM models across all providers

Showing 1–24 of 24 models

GLM-5.21.0M ctx

Z.ai's (formerly Zhipu AI) flagship open-weight coding model with a 1M-token context window. Mixture-of-Experts architecture with 753B total parameters and ~40B active per request, featuring two cost-balancing reasoning modes. Tops several coding benchmarks while remaining a fraction of the cost of comparable proprietary frontier models. MIT-licensed weights.

chatcompletionfunction-calling

Kimi K2.7 Code1.0M ctx

Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.

chatcompletionfunction-calling

MiniMax M31.0M ctx

MiniMax's frontier open-weight model with 1M-token context window, native multimodality (text, image, video), and strong coding capabilities. Built on MiniMax Sparse Attention (MSA) architecture, achieving 59% on SWE-Bench Pro with significantly improved efficiency at long context.

chatcompletionfunction-calling

Ring-2.6-1T131K ctx

InclusionAI's (Ant Group) trillion-parameter open-weights reasoning model with 63B active parameters per token. Built for real-world agent workflows with adaptive reasoning-effort modes. Features hybrid linear and MLA attention architecture with MIT license.

chatcompletionfunction-calling

Yi-Lightning131K ctx

01.AI's flagship large language model with enhanced Mixture-of-Experts architecture. Ranked 6th on Chatbot Arena with particularly strong results in Chinese, Math, Coding, and Hard Prompts categories. Features advanced expert segmentation and optimized KV-caching.

chatcompletioncode-generation

Qwen 3.7 Max131K ctx

Alibaba's flagship proprietary model engineered for advanced agentic coding, complex reasoning, and long-horizon task execution. Ranked

chatcompletionfunction-calling

Qwen 3.7 Plus131K ctx

Alibaba's multimodal variant in the Qwen 3.7 family, optimized for vision understanding and multimodal tasks. Ranked

chatcompletionfunction-calling

DeepSeek V4 Pro1.0M ctx

DeepSeek's flagship V4 model with 1.6T total parameters (49B activated). MoE architecture supporting 1M token context. Closes the gap with frontier proprietary models on reasoning and coding benchmarks.

chatcompletionfunction-calling

MiMo-V2.5-Pro1.0M ctx

Xiaomi's flagship 1.02T-parameter Mixture-of-Experts model with 42B active parameters, built on a hybrid-attention architecture with 3-layer Multi-Token Prediction. Designed for complex agentic tasks, software engineering, and long-horizon instruction following with a 1M-token context window.

chatcompletionfunction-calling

Qwen 3.6 27B131K ctx

Alibaba's dense 27B parameter model that outperforms its own 397B MoE predecessor on agentic coding benchmarks. Strong multilingual and reasoning capabilities released under Apache 2.0.

chatcompletionfunction-calling

Hy3 Preview256K ctx

Tencent's flagship open-weight Mixture-of-Experts model from the Hunyuan family with 295B total parameters and 21B active. Integrates fast and slow thinking modes with configurable reasoning effort. Designed for agentic workflows, cross-file code refactoring, long-document analysis, and multi-step tool use.

chatcompletionfunction-calling

Qwen 3.6 35B-A3B131K ctx

Alibaba's efficient Mixture-of-Experts model with 35B total parameters and 3B active per token. Frontier-level agentic coding performance with 73.4% on SWE-bench Verified and 92.7 on AIME 2026. Released under Apache 2.0.

chatcompletionfunction-calling

Qwen 3.6 Plus131K ctx

Alibaba's proprietary flagship model in the Qwen 3.6 family, targeting enterprise AI workflows with stronger agentic coding capability, visual coding support, and end-to-end enterprise engineering features.

chatcompletionfunction-calling

GLM-5.1131K ctx

Zhipu AI's latest bilingual model with strong Chinese and English capabilities. Features improved reasoning, coding, and tool use with competitive performance on academic benchmarks.

chatcompletionfunction-calling

Kimi K2.61.0M ctx

Moonshot AI's latest model with ultra-long context window support, strong reasoning capabilities, and excellent performance on complex multi-step tasks. Known for reliable long-document understanding.

chatcompletionfunction-calling

MiniMax M2.7200K ctx

MiniMax's latest large language model with strong multilingual and multimodal capabilities. Competitive pricing with high-quality text generation and improved reasoning performance.

chatcompletionfunction-calling

Qwen 3.6131K ctx

Alibaba's latest Qwen model with enhanced reasoning, multilingual capabilities, and improved instruction following. Features strong performance on coding, math, and general knowledge benchmarks.

chatcompletionfunction-calling

DeepSeek V4256K ctx

DeepSeek's fourth-generation model with improved mixture-of-experts architecture, enhanced reasoning and coding capabilities, and stronger multilingual performance. Competitive with frontier proprietary models.

chatcompletionfunction-calling

GLM-4.7131K ctx

Zhipu AI's multilingual agentic coding model with strong reasoning, tool use, and UI generation capabilities. Predecessor to GLM-5.1 with competitive performance on coding benchmarks.

chatcompletionfunction-calling

Qwen3 235B131K ctx

Alibaba's Qwen3 235B mixture-of-experts model delivering frontier-level performance with advanced reasoning, function calling, and code generation capabilities at massive scale.

chatcompletionfunction-calling

Qwen3 32B131K ctx

Alibaba's Qwen3 32B dense language model with strong reasoning and multilingual capabilities, supporting function calling and code generation across diverse tasks.

chatcompletionfunction-calling

QwQ 32B131K ctx

Alibaba's QwQ 32B reasoning-focused model designed for complex problem solving, mathematical reasoning, and step-by-step logical analysis with strong chain-of-thought capabilities.

chatcompletionreasoning

DeepSeek R1131K ctx

DeepSeek's reasoning-focused model trained with reinforcement learning for complex multi-step reasoning. Excels at math, science, and coding problems requiring chain-of-thought reasoning.

chatcompletioncode-generation

DeepSeek V3128K ctx

DeepSeek's third-generation large language model featuring mixture-of-experts architecture, strong multilingual capabilities, and competitive performance on reasoning and coding benchmarks.

chatcompletionfunction-calling