> ## Documentation Index
> Fetch the complete documentation index at: https://docs.wolffi.sh/llms.txt
> Use this file to discover all available pages before exploring further.

# xAI

> Set up xAI (Grok) — strong reasoning and vision with competitive pricing

# xAI (Grok)

```
POST https://api.x.ai/v1/chat/completions
```

Uses SSE streaming with OpenAI-compatible tool-calling format. Supports vision (base64 images) and reasoning content.

**xAI offers Grok models with strong reasoning capabilities and competitive pricing.** Grok 4.3 is the flagship with 1M context, vision support, and three reasoning modes — sitting between the budget Chinese providers and the premium Western tier on price. The newer Grok Build model is purpose-built for code generation at an attractive price point.

Best for: Reasoning-heavy workflows, code generation, vision tasks, and agentic automations where you want a balance of capability and cost.

## Getting an API Key

1. Go to [console.x.ai](https://console.x.ai)
2. Sign up or log in
3. Navigate to **API Keys** and create a new key
4. Paste it into Wolffish → Settings → Models → xAI

## Models

| Model                        | Modes     | Context | Input / Output (per MTok) | Cached |
| ---------------------------- | --------- | ------- | ------------------------- | ------ |
| **grok-4.3**                 | Off, High | 1M      | $1.25 / $2.50             | —      |
| grok-4.20-0309-reasoning     | On        | 256K    | $1.25 / $2.50             | —      |
| grok-4.20-0309-non-reasoning | —         | 256K    | $1.25 / $2.50             | —      |
| grok-build-0.1               | On        | 256K    | $1.00 / $2.00             | —      |

## Reasoning modes

The **brain icon** next to the message box controls how this model reasons. Click it to cycle through the modes the selected model supports. Two separate ideas combine here:

### Thinking — *whether* the model reasons

* **Off** — the model answers immediately. Fastest and cheapest; ideal for simple, direct tasks.
* **On** — the model first works through the problem in a dedicated reasoning pass before replying. Slower and uses more tokens, but markedly more accurate on multi-step, logical, or ambiguous tasks.

### Effort — *how hard* it thinks

Only effort-capable models expose this; it applies once thinking is on.

* **High** — standard reasoning depth. The right default for most agentic work.
* **Max** — the model reasons longer and deeper for the hardest problems. More tokens and latency in exchange for higher quality on complex work.

### Button states

| State | Colour | Meaning                         |
| ----- | ------ | ------------------------------- |
| Off   | gray   | Thinking off — direct answer    |
| On    | blue   | Thinking on — no effort control |
| High  | purple | Thinking on, standard effort    |
| Max   | orange | Thinking on, maximum effort     |

Each model shows only the states it genuinely supports. If a model always reasons (can't be turned off) or has no effort control, the button reflects that and locks where there's nothing to change. Wolffish remembers your choice per model.

**On xAI:** grok-4.3 supports Off / High. grok-4.x and grok-build reason always-on, so the button stays locked on; non-reasoning grok variants have no control.

<Tip>
  Grok 4.3 is competitively priced against Anthropic and OpenAI while offering 1M context and vision. If you need a mid-tier provider with strong reasoning that's cheaper than Claude or GPT, xAI is a solid choice.
</Tip>
