Cohere's compact 7B parameter model optimized for RAG, tool use, and code tasks. Delivers top-tier speed and efficiency on commodity GPUs and edge devices with 128K context window.
128K tokens
1
available
Cheapest
Cohere
$0.19/1M tokens
Cohere
Sorted by total cost (input + output per 1M tokens). Click a row to view provider details.
| Provider | Pricing (per 1M) | Rate Limits | Regions | Health | Latency |
|---|---|---|---|---|---|
In: $0.04Out: $0.15 | 500 RPM / 1.0M TPM | us-east-1eu-west-1global | Healthy | 0ms |
Use this model via Cohere with an OpenAI-compatible SDK.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.cohere.com/v1",
apiKey: process.env.COHERE_API_KEY,
});
const response = await client.chat.completions.create({
model: "command-r7b-12-2024",
messages: [
{ role: "user", content: "Hello!" }
],
});
console.log(response.choices[0].message.content);Using Cohere API • OpenAI-compatible SDK