xAIPaidOperational

Grok 4.1 Fast Non Reasoning

API model name: grok-4.1-fast-non-reasoning

Grok 4.1 Fast Non Reasoning is xAI's chat model, served on the Api.Airforce unified API. It has a 2M-token context window. Beyond text, it accepts image as input. Capabilities include Vision, Tool calling, Prompt caching. It is priced at $0.18 per million input tokens and $0.18 per million output tokens. That is below the provider's $0.20 official input rate. Knowledge cutoff: 2025-09. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Pricing

Input / 1M tokens
$0.18
Output / 1M tokens
$0.18
Official input rate
$0.20
Official output rate
$0.50

Api.Airforce price vs. the provider's official rate.

Specifications

Provider
xAI
Type
chat model
Context window
2M tokens
Max output
33K tokens
Knowledge cutoff
2025-09
Input
text, image
Output
text
Prompt caching
Supported

Capabilities

VisionTool callingPrompt cachingStreaming

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index
38.6/100
Coding Index
30.9/100
Math Index
89.3/100
MMLU-Pro85%
GPQA Diamond85%
Humanity's Last Exam18%
LiveCodeBench82%
AIME 202589%

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Grok 4.1 Fast Non Reasoning used for?

  • Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
  • Image understanding — analyze photos, screenshots, charts and scanned documents.
  • Agents & automation — function calling and tool use for multi-step workflows.
  • Long-context tasks — process entire documents or codebases in a single prompt.
  • Real-time experiences — stream tokens for responsive chat and apps.

Grok 4.1 Fast Non Reasoning vs. similar models

ModelIntelligenceContextInput / 1MOutput / 1M
Grok 4.1 Fast Non Reasoning38.62M$0.18$0.18
Grok 2 Vision$3.12$15.57
Grok 3$0.45$2.25
Grok 3 Deepsearch$0.42$2.13

Prices are Api.Airforce pay-as-you-go rates per 1M tokens. Context is the maximum input length.

Related models

Grok 4.1 Fast Non Reasoning — frequently asked questions

How much does Grok 4.1 Fast Non Reasoning cost?
Grok 4.1 Fast Non Reasoning is billed pay-as-you-go at $0.18 per 1M input tokens and $0.18 per 1M output tokens. There is no subscription — you only pay for what you use.
What is the context window of Grok 4.1 Fast Non Reasoning?
Grok 4.1 Fast Non Reasoning supports a context window of up to 2M tokens. It can return up to 33K tokens in a single response.
What can Grok 4.1 Fast Non Reasoning do?
Grok 4.1 Fast Non Reasoning supports Vision, Tool calling, Prompt caching.
Is Grok 4.1 Fast Non Reasoning free to use?
Grok 4.1 Fast Non Reasoning is a paid, pay-as-you-go model — no subscription, you are only charged for usage.
How do I use Grok 4.1 Fast Non Reasoning via the API?
Grok 4.1 Fast Non Reasoning is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID grok-4.1-fast-non-reasoning with your Api.Airforce API key.
Who makes Grok 4.1 Fast Non Reasoning?
Grok 4.1 Fast Non Reasoning is xAI's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.