Models
Browse 2 canonical LLM models across all providers
GigaChat 3.1 Lightning8K ctx
Sber's compact Mixture-of-Experts model with 10B total parameters and 1.8B active. Designed for fast multilingual assistant workloads, reasoning, code, function calling, and product-style deployment on edge devices.
chatcompletionfunction-calling
GigaChat 3.1 Ultra32K ctx
Sber's flagship large-scale Mixture-of-Experts model with 702B total parameters and 36B active. Designed for multilingual assistant workloads, reasoning, code generation, tool use, and large-cluster deployment. Open-weight release.
chatcompletionfunction-calling