Models

Browse 4 canonical LLM models across all providers

Sort by

Showing 1–4 of 4 models

Sarvam AI's compact 2B-parameter language model built from the ground up for Indian languages. Provides best-in-class performance across 10 Indic languages (bn, gu, hi, kn, ml, mr, or, pa, ta, te) alongside English, outperforming larger general-purpose models like Gemma-2-2B and Llama-3.2-3B thanks to careful data curation and an efficient Indic tokenizer. Edge-deployable.

chatcompletion

Sarvam-105B128K ctx

Sarvam AI's sovereign 105B-parameter Mixture-of-Experts model activating ~9B parameters per token, with a 128K-token context window. Trained on 12 trillion tokens across 22 Indian languages using 128 sparse experts with Multi-head Latent Attention and a custom low-fertility Indic tokenizer. Wins the majority of pairwise comparisons on Indian-language and STEM benchmarks.

chatcompletionfunction-calling

Sarvam-30B128K ctx

Sarvam AI's 30B-parameter Mixture-of-Experts reasoning model trained from scratch with only 2.4B active parameters per token. Optimized for real-time deployment and Indian languages, delivering strong reasoning, coding, and conversational performance while remaining efficient to serve. Open-weights.

chatcompletionfunction-calling

Sarvam-M131K ctx

Sarvam AI's 24B-parameter instruction-tuned model derived from Mistral-Small-3.1-24B, post-trained on English plus eleven major Indic languages (bn, hi, kn, gu, mr, ml, or, pa, ta, te). Delivers large relative gains on Indian-language, math, and programming benchmarks over its base model, with a hybrid reasoning mode for complex tasks.

chatcompletionfunction-calling