XiaomiPaidOperational

Mimo V2.5 Pro

API model name: mimo-v2.5-pro

Mimo V2.5 Pro is Xiaomi's chat model, served on the Api.Airforce unified API. It has a 1M-token context window. Capabilities include Tool calling, Reasoning, Prompt caching. It is priced at $0.80 per million input tokens and $2.40 per million output tokens. That is below the provider's $1.00 official input rate. Access it through the OpenAI-compatible API with one key, alongside 65+ other models on Api.Airforce.

Pricing

Input / 1M tokens
$0.80
Output / 1M tokens
$2.40
Official input rate
$1.00

Api.Airforce price vs. the provider's official rate.

Specifications

Provider
Xiaomi
Type
chat model
Context window
1M tokens
Max output
16K tokens
Input
text
Output
text

Capabilities

Tool callingReasoningPrompt cachingStreaming

Use Mimo V2.5 Pro via the API

OpenAI-compatible — point any OpenAI SDK at https://api.airforce/v1 and pass mimo-v2.5-pro as the model.

cURL
curl https://api.airforce/v1/chat/completions \
  -H "Authorization: Bearer $AIRFORCE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "mimo-v2.5-pro",
    "messages": [{ "role": "user", "content": "Hello!" }]
  }'
Python
from openai import OpenAI
client = OpenAI(base_url="https://api.airforce/v1", api_key="$AIRFORCE_API_KEY")
r = client.chat.completions.create(
    model="mimo-v2.5-pro",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(r.choices[0].message.content)
JavaScript
import OpenAI from "openai";
const client = new OpenAI({ baseURL: "https://api.airforce/v1", apiKey: process.env.AIRFORCE_API_KEY });
const r = await client.chat.completions.create({
  model: "mimo-v2.5-pro",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(r.choices[0].message.content);

Live performance

Real throughput and latency across the suppliers serving this model.

Loading live metrics…

Related models