How much does Grok 4.1 Fast Reasoning cost?

Grok 4.1 Fast Reasoning is billed pay-as-you-go at $0.14 per 1M input tokens and $0.35 per 1M output tokens. There is no subscription — you only pay for what you use.

What is the context window of Grok 4.1 Fast Reasoning?

Grok 4.1 Fast Reasoning supports a context window of up to 2M tokens. It can return up to 33K tokens in a single response.

What can Grok 4.1 Fast Reasoning do?

Grok 4.1 Fast Reasoning supports Vision, Tool calling, Reasoning, Prompt caching.

Is Grok 4.1 Fast Reasoning free to use?

Grok 4.1 Fast Reasoning is a paid, pay-as-you-go model — no subscription, you are only charged for usage.

How do I use Grok 4.1 Fast Reasoning via the API?

Grok 4.1 Fast Reasoning is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID grok-4.1-fast-reasoning with your Api.Airforce API key.

Who makes Grok 4.1 Fast Reasoning?

Grok 4.1 Fast Reasoning is xAI's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.

xAIPaidOperational

Grok 4.1 Fast Reasoning

API model name: grok-4.1-fast-reasoning

Grok 4.1 Fast Reasoning is xAI's chat model, served on the Api.Airforce unified API. It has a 2M-token context window. Beyond text, it accepts image as input. Capabilities include Vision, Tool calling, Reasoning, Prompt caching. It is priced at $0.14 per million input tokens and $0.35 per million output tokens. That is below the provider's $0.20 official input rate. Knowledge cutoff: 2025-09. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Get an API key View pricing

Pricing

Input / 1M tokens

$0.14

Output / 1M tokens

$0.35

Official input rate

$0.20

Official output rate

$0.50

Api.Airforce price vs. the provider's official rate.

Specifications

Provider: xAI
Type: chat model
Context window: 2M tokens
Max output: 33K tokens
Knowledge cutoff: 2025-09
Input: text, image
Output: text
Prompt caching: Supported

Capabilities

VisionTool callingReasoningPrompt cachingStreaming

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index

30.6/100

Math Index

89.3/100

MMLU-Pro85%

GPQA Diamond85%

Humanity's Last Exam18%

LiveCodeBench82%

AIME 202589%

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Grok 4.1 Fast Reasoning used for?

Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
Image understanding — analyze photos, screenshots, charts and scanned documents.
Agents & automation — function calling and tool use for multi-step workflows.
Complex reasoning — math, coding and step-by-step problem solving.
Long-context tasks — process entire documents or codebases in a single prompt.
Real-time experiences — stream tokens for responsive chat and apps.