Large language model developed by ISSAI (Nazarbayev University) customized from Llama 3.1 70B to improve helpfulness of responses in the Kazakh language. Part of Kazakhstan's initiative to ensure the country benefits from generative AI advancements.
128K tokens
1
available
Cheapest
Hugging Face Inference
$0.00/1M tokens
Hugging Face Inference
Sorted by total cost (input + output per 1M tokens). Click a row to view provider details.
| Provider | Pricing (per 1M) | Rate Limits | Regions | Health | Latency |
|---|---|---|---|---|---|
In: FreeOut: Free | 60 RPM / 300K TPM | us-east-1eu-west-1 | Healthy | 0ms |
Use this model via Hugging Face Inference with an OpenAI-compatible SDK.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.hugging-face.com/v1",
apiKey: process.env.HUGGING_FACE_API_KEY,
});
const response = await client.chat.completions.create({
model: "issai/LLama-3.1-KazLLM-1.0-70B",
messages: [
{ role: "user", content: "Hello!" }
],
});
console.log(response.choices[0].message.content);Using Hugging Face Inference API • OpenAI-compatible SDK