USDC-backed inference on Base for autonomous agents

Agent AI Gateway

A drop-in, OpenAI-compatible gateway that lets agents pay with USDC on Base for long-running inference. Built for OpenClaw, OpenCode, Claude Code, and any SDK that already speaks OpenAI.

OpenAI-compatible+5% price transparencyx402 funding flowLLM-friendly docs
Get startedllm.txt
Fetch skills for your agent:
curl -fsSL https://agentaigateway.com/skill.md

Why agents choose this gateway

Everything is optimized for autonomous tooling: predictable headers, explicit pricing, and a clean OpenAI-compatible surface area.

Wallet-first billing

Agents attach a wallet once, then spend pre-funded credits. No per-request signing, no brittle payment steps, no human-in-the-loop.

Transparent pricing

We mirror Vercel AI Gateway pricing and apply a visible 5% markup. The models table below is exactly what your agent is billed.

Usage telemetry

Global per-model token counters keep capacity planning honest. Headers tell agents their remaining balance after every call.

Agent setup

Designed for scripts, SDKs, and autonomous frameworks. No UI required.

1. Install gateway skills

Use the bundled skills installer when it ships. Placeholder below.

curl -fsSL https://agentaigateway.com/skill.md

2. Configure SDK base URL

Point your OpenAI-compatible client to the gateway.

export OPENAI_BASE_URL="https://api.agentaigateway.com/v1"

3. Fund via x402 deposit

Call POST /v1/deposit to get x402 payment requirements, then sign with your wallet.

POST /v1/deposit {"amount_usdc": 20}
→ 402 + X-Payment-Required header
→ Sign & resend with X-Payment header

4. Attach wallet context

Every request includes your wallet header for credit accounting.

-H "X-Wallet-Address: 0xYOUR_WALLET"

5. Read low-balance hints

Agents can react to headers when credits run low.

X-Low-Balance: true
Check balance
Model pricing (gateway billed)
Auto-refreshed daily from Vercel AI Gateway + 5% markup
251 models
Tags
alibaba/qwen-3-14b
alibaba40,960$0.12 / 1M$0.25 / 1Mreasoning, tool-use
alibaba/qwen-3-235b
alibaba40,960$0.07 / 1M$0.49 / 1Mtool-use
alibaba/qwen-3-30b
alibaba40,960$0.08 / 1M$0.3 / 1Mreasoning, tool-use
alibaba/qwen-3-32b
alibaba131,072$0.3 / 1M$0.61 / 1Mreasoning, tool-use
alibaba/qwen3-235b-a22b-thinking
alibaba262,114$0.24 / 1M$2.4 / 1Mtool-use, vision, file-input
alibaba/qwen3-coder
alibaba262,144$0.4 / 1M$1.6 / 1Mtool-use
alibaba/qwen3-coder-30b-a3b
alibaba262,144$0.15 / 1M$0.6 / 1Mreasoning, tool-use
alibaba/qwen3-coder-next
alibaba256,000$0.5 / 1M$1.2 / 1Mtool-use
alibaba/qwen3-coder-plus
alibaba1,000,000$1 / 1M$5 / 1Mtool-use
alibaba/qwen3-embedding-0.6b
alibaba32,768$0.01 / 1M-
alibaba/qwen3-embedding-4b
alibaba32,768$0.02 / 1M-
alibaba/qwen3-embedding-8b
alibaba32,768$0.05 / 1M-
alibaba/qwen3-max
alibaba262,144$1.2 / 1M$6 / 1Mtool-use, implicit-caching
alibaba/qwen3-max-preview
alibaba262,144$1.2 / 1M$6 / 1Mtool-use, implicit-caching
alibaba/qwen3-max-thinking
alibaba256,000$1.2 / 1M$6 / 1Mreasoning, tool-use, implicit-caching
alibaba/qwen3-next-80b-a3b-instruct
alibaba262,144$0.09 / 1M$1.1 / 1M
alibaba/qwen3-next-80b-a3b-thinking
alibaba131,072$0.15 / 1M$1.2 / 1M-
alibaba/qwen3-vl-instruct
alibaba262,144$0.2 / 1M$0.92 / 1Mvision, file-input
alibaba/qwen3-vl-thinking
alibaba256,000$0.23 / 1M$0.92 / 1Mvision, reasoning, tool-use
alibaba/qwen3.5-flash
alibaba1,000,000$0.1 / 1M$0.4 / 1Mvision, file-input, reasoning
alibaba/qwen3.5-plus
alibaba1,000,000$0.4 / 1M$2.5 / 1Mvision, file-input, reasoning
alibaba/wan-v2.5-t2v-preview
alibaba---
alibaba/wan-v2.6-i2v
alibaba---
alibaba/wan-v2.6-i2v-flash
alibaba---
alibaba/wan-v2.6-r2v
alibaba---
alibaba/wan-v2.6-r2v-flash
alibaba---
alibaba/wan-v2.6-t2v
alibaba---
amazon/nova-2-lite
amazon1,000,000$0.3 / 1M$2.6 / 1Mreasoning, vision
amazon/nova-lite
amazon300,000$0.06 / 1M$0.25 / 1M-
amazon/nova-micro
amazon128,000$0.04 / 1M$0.14 / 1M-
amazon/nova-pro
amazon300,000$0.8 / 1M$3.3 / 1M-
amazon/titan-embed-text-v2
amazon-$0.02 / 1M--
anthropic/claude-3-haiku
anthropic200,000$0.26 / 1M$1.31 / 1Mtool-use, vision
anthropic/claude-3-opus
anthropic200,000$15 / 1M$78 / 1M
anthropic/claude-3.5-haiku
anthropic200,000$0.8 / 1M$4 / 1Mfile-input, tool-use, vision
anthropic/claude-3.5-sonnet
anthropic200,000$3 / 1M$15 / 1Mfile-input, tool-use, vision
anthropic/claude-3.5-sonnet-20240620
anthropic200,000$3 / 1M$15 / 1Mfile-input, tool-use, vision
anthropic/claude-3.7-sonnet
anthropic200,000$3 / 1M$15 / 1Mfile-input, reasoning, tool-use
anthropic/claude-haiku-4.5
anthropic200,000$1 / 1M$5 / 1Mfile-input, reasoning, tool-use
anthropic/claude-opus-4
anthropic200,000$15 / 1M$78 / 1Mfile-input, reasoning, tool-use
anthropic/claude-opus-4.1
anthropic200,000$15 / 1M$78 / 1Mfile-input, reasoning, tool-use
anthropic/claude-opus-4.5
anthropic200,000$5 / 1M$26 / 1Mtool-use, reasoning, vision
anthropic/claude-opus-4.6
anthropic1,000,000$5 / 1M$26 / 1Mtool-use, reasoning, vision
anthropic/claude-sonnet-4
anthropic1,000,000$3 / 1M$15 / 1Mfile-input, reasoning, tool-use
anthropic/claude-sonnet-4.5
anthropic1,000,000$3 / 1M$15 / 1Mfile-input, reasoning, tool-use
anthropic/claude-sonnet-4.6
anthropic1,000,000$3 / 1M$15 / 1Mfile-input, reasoning, tool-use
arcee-ai/trinity-large-preview
arcee-ai131,000$0.26 / 1M$1 / 1Mtool-use
arcee-ai/trinity-mini
arcee-ai131,072$0.05 / 1M$0.15 / 1M
bfl/flux-2-flex
bfl---image-generation
bfl/flux-2-klein-4b
bfl---image-generation
bfl/flux-2-klein-9b
bfl---image-generation
bfl/flux-2-max
bfl67,300--image-generation
bfl/flux-2-pro
bfl67,300--image-generation
bfl/flux-kontext-max
bfl512--image-generation
bfl/flux-kontext-pro
bfl512--image-generation
bfl/flux-pro-1.0-fill
bfl---image-generation
bfl/flux-pro-1.1
bfl---image-generation
bfl/flux-pro-1.1-ultra
bfl---image-generation
bytedance/seed-1.6
bytedance256,000$0.26 / 1M$2 / 1Mreasoning, tool-use, implicit-caching
bytedance/seed-1.8
bytedance256,000$0.26 / 1M$2 / 1Mreasoning, vision, implicit-caching
bytedance/seedance-v1.0-lite-i2v
bytedance---
bytedance/seedance-v1.0-lite-t2v
bytedance---
bytedance/seedance-v1.0-pro
bytedance---
bytedance/seedance-v1.0-pro-fast
bytedance---
bytedance/seedance-v1.5-pro
bytedance---
cohere/command-a
cohere256,000$2.6 / 1M$10 / 1Mtool-use
cohere/embed-v4.0
cohere-$0.12 / 1M--
deepseek/deepseek-r1
deepseek128,000$1.41 / 1M$5.6 / 1Mreasoning, tool-use
deepseek/deepseek-v3
deepseek163,840$0.8 / 1M$0.8 / 1Mtool-use
deepseek/deepseek-v3.1
deepseek163,840$0.5 / 1M$1.5 / 1Mreasoning, tool-use
deepseek/deepseek-v3.1-terminus
deepseek131,072$0.28 / 1M$1 / 1Mreasoning, tool-use
deepseek/deepseek-v3.2
deepseek128,000$0.29 / 1M$0.44 / 1Mtool-use, implicit-caching
deepseek/deepseek-v3.2-thinking
deepseek128,000$0.29 / 1M$0.44 / 1Mreasoning, tool-use, implicit-caching
google/gemini-2.0-flash
google1,048,576$0.15 / 1M$0.6 / 1Mfile-input, tool-use, vision
google/gemini-2.0-flash-lite
google1,048,576$0.08 / 1M$0.3 / 1Mfile-input, tool-use, vision
google/gemini-2.5-flash
google1,000,000$0.3 / 1M$2.6 / 1Mfile-input, reasoning, tool-use
google/gemini-2.5-flash-image
google32,768$0.3 / 1M$2.6 / 1Mimage-generation
google/gemini-2.5-flash-lite
google1,048,576$0.1 / 1M$0.4 / 1Mfile-input, reasoning, tool-use
google/gemini-2.5-pro
google1,048,576$1.31 / 1M$10 / 1Mfile-input, reasoning, tool-use
google/gemini-3-flash
google1,000,000$0.5 / 1M$3 / 1Mreasoning, tool-use, file-input
google/gemini-3-pro-image
google65,536$2 / 1M$12 / 1Mimage-generation
google/gemini-3-pro-preview
google1,000,000$2 / 1M$12 / 1Mfile-input, tool-use, reasoning
google/gemini-3.1-flash-image-preview
google131,072$0.5 / 1M$3 / 1Mimage-generation, vision, reasoning
google/gemini-3.1-flash-lite-preview
google1,000,000$0.26 / 1M$1.5 / 1Mreasoning, tool-use, implicit-caching
google/gemini-3.1-pro-preview
google1,000,000$2 / 1M$12 / 1Mfile-input, tool-use, reasoning
google/gemini-embedding-001
google-$0.15 / 1M--
google/gemini-embedding-2
google-$0.2 / 1M-
google/imagen-4.0-fast-generate-001
google480--image-generation
google/imagen-4.0-generate-001
google480--image-generation
google/imagen-4.0-ultra-generate-001
google480--image-generation
google/text-embedding-005
google-$0.03 / 1M--
google/text-multilingual-embedding-002
google-$0.03 / 1M--
google/veo-3.0-fast-generate-001
google---
google/veo-3.0-generate-001
google---
google/veo-3.1-fast-generate-001
google---
google/veo-3.1-generate-001
google---
inception/mercury-2
inception128,000$0.26 / 1M$0.78 / 1Mtool-use, reasoning
inception/mercury-coder-small
inception32,000$0.26 / 1M$1 / 1Mtool-use
klingai/kling-v2.5-turbo-i2v
klingai---
klingai/kling-v2.5-turbo-t2v
klingai---
klingai/kling-v2.6-i2v
klingai---
klingai/kling-v2.6-motion-control
klingai---
klingai/kling-v2.6-t2v
klingai---
klingai/kling-v3.0-i2v
klingai---
klingai/kling-v3.0-t2v
klingai---
kwaipilot/kat-coder-pro-v1
kwaipilot256,000$0.03 / 1M$1.2 / 1Mreasoning
meituan/longcat-flash-chat
meituan128,000--tool-use
meituan/longcat-flash-thinking
meituan128,000$0.15 / 1M$1.5 / 1Mreasoning, tool-use
meituan/longcat-flash-thinking-2601
meituan32,768--reasoning
meta/llama-3.1-70b
meta128,000$0.75 / 1M$0.75 / 1Mtool-use
meta/llama-3.1-8b
meta128,000$0.1 / 1M$0.1 / 1Mtool-use
meta/llama-3.2-11b
meta128,000$0.16 / 1M$0.16 / 1Mtool-use, vision
meta/llama-3.2-1b
meta128,000$0.1 / 1M$0.1 / 1M-
meta/llama-3.2-3b
meta128,000$0.15 / 1M$0.15 / 1M-
meta/llama-3.2-90b
meta128,000$0.75 / 1M$0.75 / 1Mtool-use, vision
meta/llama-3.3-70b
meta128,000$0.75 / 1M$0.75 / 1Mtool-use
meta/llama-4-maverick
meta128,000$0.25 / 1M$1.01 / 1Mtool-use, vision
meta/llama-4-scout
meta128,000$0.17 / 1M$0.69 / 1Mtool-use, vision
minimax/minimax-m2
minimax205,000$0.3 / 1M$1.2 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.1
minimax204,800$0.3 / 1M$1.2 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.1-lightning
minimax204,800$0.3 / 1M$2.5 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.5
minimax204,800$0.3 / 1M$1.2 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.5-highspeed
minimax204,800$0.6 / 1M$2.5 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.7
minimax204,800$0.3 / 1M$1.2 / 1Mreasoning, tool-use, implicit-caching
minimax/minimax-m2.7-highspeed
minimax204,800$0.6 / 1M$2.5 / 1Mreasoning, tool-use, implicit-caching
mistral/codestral
mistral128,000$0.3 / 1M$0.9 / 1Mtool-use
mistral/codestral-embed
mistral-$0.15 / 1M--
mistral/devstral-2
mistral256,000$0.4 / 1M$2 / 1Mtool-use
mistral/devstral-small
mistral128,000$0.1 / 1M$0.3 / 1Mtool-use
mistral/devstral-small-2
mistral256,000$0.1 / 1M$0.3 / 1Mtool-use
mistral/magistral-medium
mistral128,000$2 / 1M$5 / 1Mreasoning, vision
mistral/magistral-small
mistral128,000$0.5 / 1M$1.5 / 1Mreasoning, vision
mistral/ministral-14b
mistral256,000$0.2 / 1M$0.2 / 1Mvision, file-input
mistral/ministral-3b
mistral128,000$0.1 / 1M$0.1 / 1Mtool-use
mistral/ministral-8b
mistral128,000$0.15 / 1M$0.15 / 1Mtool-use
mistral/mistral-embed
mistral-$0.1 / 1M--
mistral/mistral-large-3
mistral256,000$0.5 / 1M$1.5 / 1Mvision
mistral/mistral-medium
mistral128,000$0.4 / 1M$2 / 1Mtool-use, vision
mistral/mistral-nemo
mistral128,000$0.15 / 1M$0.15 / 1M
mistral/mistral-small
mistral32,000$0.1 / 1M$0.3 / 1Mtool-use, vision
mistral/mixtral-8x22b-instruct
mistral65,536$1.2 / 1M$1.2 / 1M-
mistral/pixtral-12b
mistral128,000$0.15 / 1M$0.15 / 1Mtool-use, vision
mistral/pixtral-large
mistral128,000$2 / 1M$6 / 1Mtool-use, vision
moonshotai/kimi-k2
moonshotai131,072$0.6 / 1M$2.6 / 1Mimplicit-caching, tool-use
moonshotai/kimi-k2-0905
moonshotai256,000$0.6 / 1M$2.6 / 1Mimplicit-caching, tool-use
moonshotai/kimi-k2-thinking
moonshotai262,114$0.6 / 1M$2.6 / 1Mreasoning, tool-use, implicit-caching
moonshotai/kimi-k2-thinking-turbo
moonshotai262,114$1.2 / 1M$8 / 1Mreasoning, tool-use, implicit-caching
moonshotai/kimi-k2-turbo
moonshotai256,000$1.2 / 1M$8 / 1Mtool-use
moonshotai/kimi-k2.5
moonshotai262,114$0.6 / 1M$3 / 1Mreasoning, vision, tool-use
morph/morph-v3-fast
morph81,920$0.8 / 1M$1.2 / 1M-
morph/morph-v3-large
morph81,920$0.9 / 1M$1.9 / 1M-
nvidia/nemotron-3-nano-30b-a3b
nvidia262,144$0.05 / 1M$0.25 / 1Mreasoning
nvidia/nemotron-nano-12b-v2-vl
nvidia131,072$0.2 / 1M$0.6 / 1Mvision, reasoning, tool-use
nvidia/nemotron-nano-9b-v2
nvidia131,072$0.06 / 1M$0.24 / 1Mreasoning, tool-use
openai/gpt-3.5-turbo
openai16,385$0.5 / 1M$1.5 / 1M-
openai/gpt-3.5-turbo-instruct
openai8,192$1.5 / 1M$2 / 1M-
openai/gpt-4-turbo
openai128,000$10 / 1M$30 / 1Mtool-use, vision
openai/gpt-4.1
openai1,047,576$2 / 1M$8 / 1Mfile-input, tool-use, vision
openai/gpt-4.1-mini
openai1,047,576$0.4 / 1M$1.6 / 1Mfile-input, tool-use, vision
openai/gpt-4.1-nano
openai1,047,576$0.1 / 1M$0.4 / 1Mfile-input, tool-use, vision
openai/gpt-4o
openai128,000$2.6 / 1M$10 / 1Mfile-input, tool-use, vision
openai/gpt-4o-mini
openai128,000$0.15 / 1M$0.6 / 1Mfile-input, tool-use, vision
openai/gpt-4o-mini-search-preview
openai128,000$0.15 / 1M$0.6 / 1M
openai/gpt-5
openai400,000$1.31 / 1M$10 / 1Mfile-input, implicit-caching, reasoning
openai/gpt-5-chat
openai128,000$1.31 / 1M$10 / 1Mtool-use, implicit-caching, file-input
openai/gpt-5-codex
openai400,000$1.31 / 1M$10 / 1Mfile-input, implicit-caching, reasoning
openai/gpt-5-mini
openai400,000$0.26 / 1M$2 / 1Mfile-input, implicit-caching, reasoning
openai/gpt-5-nano
openai400,000$0.05 / 1M$0.4 / 1Mfile-input, implicit-caching, reasoning
openai/gpt-5-pro
openai400,000$15 / 1M$120 / 1Mfile-input, implicit-caching, reasoning
openai/gpt-5.1-codex
openai400,000$1.31 / 1M$10 / 1Mfile-input, tool-use, reasoning
openai/gpt-5.1-codex-max
openai400,000$1.31 / 1M$10 / 1Mreasoning, file-input, tool-use
openai/gpt-5.1-codex-mini
openai400,000$0.26 / 1M$2 / 1Mreasoning, file-input, vision
openai/gpt-5.1-instant
openai128,000$1.31 / 1M$10 / 1Mtool-use, vision, file-input
openai/gpt-5.1-thinking
openai400,000$1.31 / 1M$10 / 1Mtool-use, implicit-caching, file-input
openai/gpt-5.2
openai400,000$1.83 / 1M$14 / 1Mtool-use, vision, file-input
openai/gpt-5.2-chat
openai128,000$1.83 / 1M$14 / 1Mvision, file-input, tool-use
openai/gpt-5.2-codex
openai400,000$1.83 / 1M$14 / 1Mfile-input, tool-use, reasoning
openai/gpt-5.2-pro
openai400,000$22 / 1M$176 / 1Mtool-use, vision, implicit-caching
openai/gpt-5.3-chat
openai128,000$1.83 / 1M$14 / 1Mvision, file-input, tool-use
openai/gpt-5.3-codex
openai400,000$1.83 / 1M$14 / 1Mreasoning, tool-use, file-input
openai/gpt-5.4
openai1,050,000$2.6 / 1M$15 / 1Mreasoning, tool-use, vision
openai/gpt-5.4-mini
openai400,000$0.78 / 1M$4.7 / 1Mreasoning, tool-use, vision
openai/gpt-5.4-nano
openai400,000$0.2 / 1M$1.31 / 1Mreasoning, tool-use, implicit-caching
openai/gpt-5.4-pro
openai1,050,000$30 / 1M$180 / 1Mreasoning, tool-use, vision
openai/gpt-image-1
openai-$5 / 1M$40 / 1Mimage-generation
openai/gpt-image-1-mini
openai-$2 / 1M$8 / 1Mimage-generation
openai/gpt-image-1.5
openai-$5 / 1M$33 / 1Mimage-generation
openai/gpt-oss-120b
openai131,072$0.36 / 1M$0.78 / 1Mimplicit-caching
openai/gpt-oss-20b
openai128,000$0.07 / 1M$0.3 / 1Mreasoning, tool-use
openai/gpt-oss-safeguard-20b
openai131,072$0.08 / 1M$0.3 / 1Mreasoning, tool-use
openai/o1
openai200,000$15 / 1M$60 / 1Mfile-input, reasoning, tool-use
openai/o3
openai200,000$2 / 1M$8 / 1Mfile-input, reasoning, tool-use
openai/o3-deep-research
openai200,000$10 / 1M$40 / 1Mreasoning, file-input, tool-use
openai/o3-mini
openai200,000$1.1 / 1M$4.6 / 1Mreasoning, tool-use, implicit-caching
openai/o3-pro
openai200,000$20 / 1M$80 / 1Mreasoning, vision, file-input
openai/o4-mini
openai200,000$1.1 / 1M$4.6 / 1Mfile-input, reasoning, tool-use
openai/text-embedding-3-large
openai-$0.13 / 1M--
openai/text-embedding-3-small
openai-$0.02 / 1M--
openai/text-embedding-ada-002
openai-$0.1 / 1M--
perplexity/sonar
perplexity127,000--tool-use, vision
perplexity/sonar-pro
perplexity200,000--tool-use, vision
perplexity/sonar-reasoning-pro
perplexity127,000--reasoning
prime-intellect/intellect-3
prime-intellect131,072$0.2 / 1M$1.1 / 1Mreasoning, tool-use
prodia/flux-fast-schnell
prodia512--image-generation
recraft/recraft-v2
recraft---image-generation
recraft/recraft-v3
recraft---image-generation
recraft/recraft-v4
recraft---image-generation
recraft/recraft-v4-pro
recraft---image-generation
voyage/voyage-3-large
voyage-$0.18 / 1M--
voyage/voyage-3.5
voyage-$0.06 / 1M--
voyage/voyage-3.5-lite
voyage-$0.02 / 1M--
voyage/voyage-4
voyage32,000$0.06 / 1M-
voyage/voyage-4-large
voyage32,000$0.12 / 1M-
voyage/voyage-4-lite
voyage32,000$0.02 / 1M-
voyage/voyage-code-2
voyage-$0.12 / 1M--
voyage/voyage-code-3
voyage-$0.18 / 1M--
voyage/voyage-finance-2
voyage-$0.12 / 1M--
voyage/voyage-law-2
voyage-$0.12 / 1M--
xai/grok-2-vision
xai32,768$2 / 1M$10 / 1Mtool-use, vision
xai/grok-3
xai131,072$3 / 1M$15 / 1Mtool-use
xai/grok-3-fast
xai131,072$5 / 1M$26 / 1Mtool-use
xai/grok-3-mini
xai131,072$0.3 / 1M$0.5 / 1Mtool-use
xai/grok-3-mini-fast
xai131,072$0.6 / 1M$4 / 1Mtool-use
xai/grok-4
xai256,000$3 / 1M$15 / 1Mreasoning, tool-use, vision
xai/grok-4-fast-non-reasoning
xai2,000,000$0.2 / 1M$0.5 / 1Mtool-use, implicit-caching
xai/grok-4-fast-reasoning
xai2,000,000$0.2 / 1M$0.5 / 1Mreasoning, tool-use, implicit-caching
xai/grok-4.1-fast-non-reasoning
xai2,000,000$0.2 / 1M$0.5 / 1Mtool-use, implicit-caching
xai/grok-4.1-fast-reasoning
xai2,000,000$0.2 / 1M$0.5 / 1Mreasoning, tool-use, implicit-caching
xai/grok-4.20-multi-agent
xai2,000,000$2 / 1M$6 / 1Mreasoning, tool-use, implicit-caching
xai/grok-4.20-multi-agent-beta
xai2,000,000$2 / 1M$6 / 1Mreasoning, tool-use, implicit-caching
xai/grok-4.20-non-reasoning
xai2,000,000$2 / 1M$6 / 1Mtool-use, implicit-caching, vision
xai/grok-4.20-non-reasoning-beta
xai2,000,000$2 / 1M$6 / 1Mtool-use, implicit-caching, vision
xai/grok-4.20-reasoning
xai2,000,000$2 / 1M$6 / 1Mreasoning, vision, tool-use
xai/grok-4.20-reasoning-beta
xai2,000,000$2 / 1M$6 / 1Mreasoning, tool-use, vision
xai/grok-code-fast-1
xai256,000$0.2 / 1M$1.5 / 1Mreasoning, tool-use, implicit-caching
xai/grok-imagine-image
xai---image-generation
xai/grok-imagine-image-pro
xai---
xai/grok-imagine-video
xai---
xiaomi/mimo-v2-flash
xiaomi262,144$0.1 / 1M$0.3 / 1Mreasoning, tool-use
xiaomi/mimo-v2-pro
xiaomi1,000,000$1 / 1M$3 / 1Mreasoning, tool-use
zai/glm-4.5
zai128,000$0.6 / 1M$2.3 / 1Mreasoning, tool-use, implicit-caching
zai/glm-4.5-air
zai128,000$0.2 / 1M$1.1 / 1Mreasoning, tool-use, implicit-caching
zai/glm-4.5v
zai66,000$0.6 / 1M$1.8 / 1Mtool-use, vision, implicit-caching
zai/glm-4.6
zai200,000$0.6 / 1M$2.3 / 1Mreasoning, tool-use, implicit-caching
zai/glm-4.6v
zai128,000$0.3 / 1M$0.9 / 1Mvision, file-input, reasoning
zai/glm-4.6v-flash
zai128,000--vision, reasoning, file-input
zai/glm-4.7
zai200,000$0.6 / 1M$2.3 / 1Mreasoning, tool-use
zai/glm-4.7-flash
zai200,000$0.07 / 1M$0.4 / 1Mreasoning, tool-use
zai/glm-4.7-flashx
zai200,000$0.06 / 1M$0.4 / 1Mreasoning, tool-use, implicit-caching
zai/glm-5
zai202,800$1 / 1M$3.3 / 1Mreasoning, tool-use, implicit-caching
zai/glm-5-turbo
zai202,800$1.2 / 1M$4 / 1Mreasoning, tool-use, implicit-caching

LLM-friendly endpoints

LLMs can read structured documentation at /llm.txt. Use it for agent discovery and machine-readable setup instructions.

Quick endpoints
OpenAI-compatible surface
GET /llm.txt GET /v1/models POST /v1/chat/completions GET /v1/balance/:wallet POST /v1/deposit (x402)