xAIPaidOperational

Grok 4.1 Fast Reasoning

API model name: grok-4.1-fast-reasoning

Grok 4.1 Fast Reasoning is xAI's chat model, served on the Api.Airforce unified API. It has a 2M-token context window. Beyond text, it accepts image as input. Capabilities include Vision, Tool calling, Reasoning, Prompt caching. It is priced at $0.50 per million input tokens and $0.50 per million output tokens. That is below the provider's $0.20 official input rate. Knowledge cutoff: 2025-09. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Pricing

Input / 1M tokens
$0.50
Output / 1M tokens
$0.50
Official input rate
$0.20
Official output rate
$0.50

Api.Airforce price vs. the provider's official rate.

Specifications

Provider
xAI
Type
chat model
Context window
2M tokens
Max output
33K tokens
Knowledge cutoff
2025-09
Input
text, image
Output
text
Prompt caching
Supported

Capabilities

VisionTool callingReasoningPrompt cachingStreaming

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index
38.6/100
Coding Index
30.9/100
Math Index
89.3/100
MMLU-Pro85%
GPQA Diamond85%
Humanity's Last Exam18%
LiveCodeBench82%
AIME 202589%

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Grok 4.1 Fast Reasoning used for?

  • Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
  • Image understanding — analyze photos, screenshots, charts and scanned documents.
  • Agents & automation — function calling and tool use for multi-step workflows.
  • Complex reasoning — math, coding and step-by-step problem solving.
  • Long-context tasks — process entire documents or codebases in a single prompt.
  • Real-time experiences — stream tokens for responsive chat and apps.

Grok 4.1 Fast Reasoning vs. similar models

ModelIntelligenceContextInput / 1MOutput / 1M
Grok 4.1 Fast Reasoning38.62M$0.50$0.50
Grok 2 Vision$3.12$15.57
Grok 3$0.45$2.25
Grok 3 Deepsearch$0.42$2.13

Prices are Api.Airforce pay-as-you-go rates per 1M tokens. Context is the maximum input length.

Related models

Grok 4.1 Fast Reasoning — frequently asked questions

How much does Grok 4.1 Fast Reasoning cost?
Grok 4.1 Fast Reasoning is billed pay-as-you-go at $0.50 per 1M input tokens and $0.50 per 1M output tokens. There is no subscription — you only pay for what you use.
What is the context window of Grok 4.1 Fast Reasoning?
Grok 4.1 Fast Reasoning supports a context window of up to 2M tokens. It can return up to 33K tokens in a single response.
What can Grok 4.1 Fast Reasoning do?
Grok 4.1 Fast Reasoning supports Vision, Tool calling, Reasoning, Prompt caching.
Is Grok 4.1 Fast Reasoning free to use?
Grok 4.1 Fast Reasoning is a paid, pay-as-you-go model — no subscription, you are only charged for usage.
How do I use Grok 4.1 Fast Reasoning via the API?
Grok 4.1 Fast Reasoning is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID grok-4.1-fast-reasoning with your Api.Airforce API key.
Who makes Grok 4.1 Fast Reasoning?
Grok 4.1 Fast Reasoning is xAI's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.