MoonshotPaidOperational

Kimi K2.6

API model name: kimi-k2.6

Kimi K2.6 is Moonshot's chat model, served on the Api.Airforce unified API. It has a 262K-token context window. Beyond text, it accepts image as input. Capabilities include Vision, Tool calling, Reasoning, Prompt caching. It is priced at $0.85 per million input tokens and $3.70 per million output tokens. That is below the provider's $0.95 official input rate. Access it through the OpenAI-compatible API with one key, alongside 65+ other models on Api.Airforce.

Pricing

Input / 1M tokens
$0.85
Output / 1M tokens
$3.70
Official input rate
$0.95

Api.Airforce price vs. the provider's official rate.

Specifications

Provider
Moonshot
Type
chat model
Context window
262K tokens
Max output
262K tokens
Input
text, image
Output
text

Capabilities

VisionTool callingReasoningPrompt cachingStreaming

Use Kimi K2.6 via the API

OpenAI-compatible — point any OpenAI SDK at https://api.airforce/v1 and pass kimi-k2.6 as the model.

cURL
curl https://api.airforce/v1/chat/completions \
  -H "Authorization: Bearer $AIRFORCE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kimi-k2.6",
    "messages": [{ "role": "user", "content": "Hello!" }]
  }'
Python
from openai import OpenAI
client = OpenAI(base_url="https://api.airforce/v1", api_key="$AIRFORCE_API_KEY")
r = client.chat.completions.create(
    model="kimi-k2.6",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(r.choices[0].message.content)
JavaScript
import OpenAI from "openai";
const client = new OpenAI({ baseURL: "https://api.airforce/v1", apiKey: process.env.AIRFORCE_API_KEY });
const r = await client.chat.completions.create({
  model: "kimi-k2.6",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(r.choices[0].message.content);

Live performance

Real throughput and latency across the suppliers serving this model.

Loading live metrics…

Related models