Yandex's compact 8B parameter language model trained on 15T tokens of primarily Russian and English text. Features 32K context window with strong performance on web, code, and mathematics tasks. Open-weight release.
32K tokens
1
available
Cheapest
Yandex Cloud
$0.60/1M tokens
Yandex Cloud
Sorted by total cost (input + output per 1M tokens). Click a row to view provider details.
| Provider | Pricing (per 1M) | Rate Limits | Regions | Health | Latency |
|---|---|---|---|---|---|
In: $0.20Out: $0.40 | 100 RPM / 200K TPM | ru-central-1 | Healthy | 0ms |
Use this model via Yandex Cloud with an OpenAI-compatible SDK.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.yandex-cloud.com/v1",
apiKey: process.env.YANDEX_CLOUD_API_KEY,
});
const response = await client.chat.completions.create({
model: "yandexgpt-lite/latest",
messages: [
{ role: "user", content: "Hello!" }
],
});
console.log(response.choices[0].message.content);Using Yandex Cloud API • OpenAI-compatible SDK