Models
Browse 3 canonical LLM models across all providers
GigaChat 3.1 Lightning8K ctx
Sber's compact Mixture-of-Experts model with 10B total parameters and 1.8B active. Designed for fast multilingual assistant workloads, reasoning, code, function calling, and product-style deployment on edge devices.
chatcompletionfunction-calling
GigaChat 3.1 Ultra32K ctx
Sber's flagship large-scale Mixture-of-Experts model with 702B total parameters and 36B active. Designed for multilingual assistant workloads, reasoning, code generation, tool use, and large-cluster deployment. Open-weight release.
chatcompletionfunction-calling
YandexGPT 5 Lite32K ctx
Yandex's compact 8B parameter language model trained on 15T tokens of primarily Russian and English text. Features 32K context window with strong performance on web, code, and mathematics tasks. Open-weight release.
chatcompletioncode-generation