How much does Gemini 3.5 Flash cost?

Gemini 3.5 Flash is billed pay-as-you-go at $1.20 per 1M input tokens and $7.20 per 1M output tokens. There is no subscription — you only pay for what you use.

What can Gemini 3.5 Flash do?

Gemini 3.5 Flash supports Vision, Tool calling, Reasoning, Documents.

Is Gemini 3.5 Flash free to use?

Gemini 3.5 Flash is a paid, pay-as-you-go model — no subscription, you are only charged for usage.

How do I use Gemini 3.5 Flash via the API?

Gemini 3.5 Flash is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID gemini-3.5-flash with your Api.Airforce API key.

Who makes Gemini 3.5 Flash?

Gemini 3.5 Flash is Google's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.

GooglePaidOperational

Gemini 3.5 Flash

API model name: gemini-3.5-flash

Gemini 3.5 Flash is Google's chat model, served on the Api.Airforce unified API. Capabilities include Vision, Tool calling, Reasoning, Documents. It is priced at $1.20 per million input tokens and $7.20 per million output tokens. That is below the provider's $1.50 official input rate. Access it through the OpenAI-compatible API with one key, alongside 100+ other models on Api.Airforce.

Get an API key View pricing

Pricing

Input / 1M tokens

$1.20

Output / 1M tokens

$7.20

Cache read / 1M tokens

$0.13

Cache write / 1M tokens

$0.08

Official input rate

$1.50

Official output rate

$9.00

Api.Airforce price vs. the provider's official rate.

Specifications

Provider: Google
Type: chat model

Capabilities

VisionTool callingReasoningDocuments

Benchmarks

Independent evaluations and measured speed from Artificial Analysis.

Intelligence Index

45.4/100

GPQA Diamond92%

Humanity's Last Exam40%

Output speed253.4 tok/s

Time to first token12.83 s

Source: Benchmark data by Artificial Analysis (artificialanalysis.ai)

What is Gemini 3.5 Flash used for?

Chatbots & assistants — conversational AI, drafting, summarizing and Q&A.
Image understanding — analyze photos, screenshots, charts and scanned documents.
Agents & automation — function calling and tool use for multi-step workflows.
Complex reasoning — math, coding and step-by-step problem solving.
Document analysis — summarize and answer questions across long files.

Gemini 3.5 Flash vs. similar models

Model	Intelligence	Context	Input / 1M	Output / 1M
Gemini 3.5 Flash	45.4	—	$1.20	$7.20
Gemini 2.5 Flash	14.1	1M	$0.40	$2.50
Gemini 2.5 Pro	25.8	2M	$0.70	$2.20
Gemini 3 Flash	27.4	1M	$0.26	$1.59

Prices are Api.Airforce pay-as-you-go rates per 1M tokens. Context is the maximum input length.

Gemini 3.5 Flash — frequently asked questions

How much does Gemini 3.5 Flash cost?: Gemini 3.5 Flash is billed pay-as-you-go at $1.20 per 1M input tokens and $7.20 per 1M output tokens. There is no subscription — you only pay for what you use.
What can Gemini 3.5 Flash do?: Gemini 3.5 Flash supports Vision, Tool calling, Reasoning, Documents.
Is Gemini 3.5 Flash free to use?: Gemini 3.5 Flash is a paid, pay-as-you-go model — no subscription, you are only charged for usage.
How do I use Gemini 3.5 Flash via the API?: Gemini 3.5 Flash is OpenAI-compatible. Point any OpenAI SDK at https://api.airforce/v1 and pass the model ID gemini-3.5-flash with your Api.Airforce API key.
Who makes Gemini 3.5 Flash?: Gemini 3.5 Flash is Google's chat model, served through the unified Api.Airforce gateway alongside 100+ other models.

All models·Quickstart·Chat API reference

Use Gemini 3.5 Flash via the API

OpenAI-compatible — point any OpenAI SDK at https://api.airforce/v1 and pass gemini-3.5-flash as the model.

cURL

curl https://api.airforce/v1/chat/completions \
  -H "Authorization: Bearer $AIRFORCE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-3.5-flash",
    "messages": [{ "role": "user", "content": "Hello!" }]
  }'

Python

from openai import OpenAI
client = OpenAI(base_url="https://api.airforce/v1", api_key="$AIRFORCE_API_KEY")
r = client.chat.completions.create(
    model="gemini-3.5-flash",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(r.choices[0].message.content)

JavaScript

import OpenAI from "openai";
const client = new OpenAI({ baseURL: "https://api.airforce/v1", apiKey: process.env.AIRFORCE_API_KEY });
const r = await client.chat.completions.create({
  model: "gemini-3.5-flash",
  messages: [{ role: "user", content: "Hello!" }],
});
console.log(r.choices[0].message.content);