Moonshot AI's latest open-source, coding-focused model in the Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. A 1-trillion-parameter model that cuts reasoning token usage by roughly 30% versus K2.6 while improving coding and agent performance — +21.8% on Kimi Code Bench v2, +11.0% on Program Bench, and +31.5% on MLS Bench Lite for multi-language support. Released under a Modified MIT License and available via Kimi APIs and Hugging Face.
1.0M tokens
3
available
Cheapest
Hugging Face Inference
$3.00/1M tokens
Hugging Face Inference, Moonshot AI, OpenRouter
Sorted by total cost (input + output per 1M tokens). Click a row to view provider details.
| Provider | Pricing (per 1M) | Rate Limits | Regions | Health | Latency |
|---|---|---|---|---|---|
In: $0.60Out: $2.40 | 300 RPM / 500K TPM | us-east-1eu-west-1 | Healthy | 0ms | |
In: $0.60Out: $2.40 | 200 RPM / 500K TPM | us-east-1 | Healthy | 0ms | |
In: $1.00Out: $3.00 | 200 RPM / 500K TPM | us-east-1global | Healthy | 0ms |
Use this model via Hugging Face Inference with an OpenAI-compatible SDK.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.hugging-face.com/v1",
apiKey: process.env.HUGGING_FACE_API_KEY,
});
const response = await client.chat.completions.create({
model: "moonshotai/Kimi-K2.7-Code",
messages: [
{ role: "user", content: "Hello!" }
],
});
console.log(response.choices[0].message.content);Using Hugging Face Inference API • OpenAI-compatible SDK