MoonshotFree

Kimi K2.6 Thinking

API model name: kimi-k2.6-thinking

Kimi K2.6 Thinking is Moonshot's chat model, served on the Api.Airforce unified API. It has a 262K-token context window. Capabilities include Tool calling, Reasoning, Prompt caching. It is available on the free tier at no per-token cost. Knowledge cutoff: 2026-02. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Pricing

Input / 1M tokens
Free
Output / 1M tokens
Free

Specifications

Provider
Moonshot
Type
chat model
Context window
262K tokens
Max output
16K tokens
Knowledge cutoff
2026-02
Input
text
Output
text
Prompt caching
Supported

Capabilities

Tool callingReasoningPrompt cachingStreaming

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index
42.9/100
Coding Index
38.4/100
GPQA Diamond79%
Humanity's Last Exam18%
Output speed32.4 tok/s
Time to first token1.46 s

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Kimi K2.6 Thinking used for?

  • Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
  • Agents & automation — function calling and tool use for multi-step workflows.
  • Complex reasoning — math, coding and step-by-step problem solving.
  • Long-context tasks — process entire documents or codebases in a single prompt.
  • Real-time experiences — stream tokens for responsive chat and apps.

Kimi K2.6 Thinking vs. similar models

ModelIntelligenceContextInput / 1MOutput / 1M
Kimi K2.6 Thinking42.9262KFreeFree
Kimi K226.3131K$0.87$3.42
Kimi K2 0711$0.87$3.42
Kimi K2 0905$2.46$10.20

Prices are Api.Airforce pay-as-you-go rates per 1M tokens. Context is the maximum input length.

Related models

Kimi K2.6 Thinking — frequently asked questions

How much does Kimi K2.6 Thinking cost?
Kimi K2.6 Thinking is available on the free tier at no token cost — create an API key and start building. Paid plans add higher rate limits.
What is the context window of Kimi K2.6 Thinking?
Kimi K2.6 Thinking supports a context window of up to 262K tokens. It can return up to 16K tokens in a single response.
What can Kimi K2.6 Thinking do?
Kimi K2.6 Thinking supports Tool calling, Reasoning, Prompt caching.
Is Kimi K2.6 Thinking free to use?
Yes — Kimi K2.6 Thinking is on the free tier. You can use it with an Api.Airforce API key at no cost.
How do I use Kimi K2.6 Thinking via the API?
Kimi K2.6 Thinking is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID kimi-k2.6-thinking with your Api.Airforce API key.
Who makes Kimi K2.6 Thinking?
Kimi K2.6 Thinking is Moonshot's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.