How much does Grok 4.1 Fast Non Reasoning cost?

Grok 4.1 Fast Non Reasoning is billed pay-as-you-go at $0.18 per 1M input tokens and $0.18 per 1M output tokens. There is no subscription — you only pay for what you use.

What is the context window of Grok 4.1 Fast Non Reasoning?

Grok 4.1 Fast Non Reasoning supports a context window of up to 2M tokens. It can return up to 33K tokens in a single response.

What can Grok 4.1 Fast Non Reasoning do?

Grok 4.1 Fast Non Reasoning supports Vision, Tool calling, Prompt caching.

Is Grok 4.1 Fast Non Reasoning free to use?

Grok 4.1 Fast Non Reasoning is a paid, pay-as-you-go model — no subscription, you are only charged for usage.

How do I use Grok 4.1 Fast Non Reasoning via the API?

Grok 4.1 Fast Non Reasoning is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID grok-4.1-fast-non-reasoning with your Api.Airforce API key.

Who makes Grok 4.1 Fast Non Reasoning?

Grok 4.1 Fast Non Reasoning is xAI's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.

xAIPaidOperational

Grok 4.1 Fast Non Reasoning

API model name: grok-4.1-fast-non-reasoning

Grok 4.1 Fast Non Reasoning is xAI's chat model, served on the Api.Airforce unified API. It has a 2M-token context window. Beyond text, it accepts image as input. Capabilities include Vision, Tool calling, Prompt caching. It is priced at $0.18 per million input tokens and $0.18 per million output tokens. That is below the provider's $0.20 official input rate. Knowledge cutoff: 2025-09. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Get an API key View pricing

Pricing

Input / 1M tokens

$0.18

Output / 1M tokens

$0.18

Official input rate

$0.20

Official output rate

$0.50

Api.Airforce price vs. the provider's official rate.

Specifications

Provider: xAI
Type: chat model
Context window: 2M tokens
Max output: 33K tokens
Knowledge cutoff: 2025-09
Input: text, image
Output: text
Prompt caching: Supported

Capabilities

VisionTool callingPrompt cachingStreaming

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index

30.6/100

Math Index

89.3/100

MMLU-Pro85%

GPQA Diamond85%

Humanity's Last Exam18%

LiveCodeBench82%

AIME 202589%

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Grok 4.1 Fast Non Reasoning used for?

Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
Image understanding — analyze photos, screenshots, charts and scanned documents.
Agents & automation — function calling and tool use for multi-step workflows.
Long-context tasks — process entire documents or codebases in a single prompt.
Real-time experiences — stream tokens for responsive chat and apps.