Skip to main content

Anthropic (Claude)

POST https://api.anthropic.com/v1/messages
Uses SSE streaming. Tool calls arrive as tool_use content blocks. Best for: Complex reasoning, detailed instruction following, nuanced tool use, computer-use (screen interaction).
Anthropic is the only provider that supports computer-use (screen interaction). If you need Wolffish to drive a browser or desktop UI, you need an Anthropic key.

Getting an API Key

  1. Go to console.anthropic.com
  2. Sign up or log in
  3. Navigate to API Keys and create a new key
  4. Paste it into Wolffish → Settings → Models → Anthropic

Models

ModelContextModesInput / Output (per MTok)Notes
claude-fable-51MOff, High, Max10.00/10.00 / 50.00Frontier reasoning. Cached: $1.00/MTok.
claude-opus-4-81MOff, High, Max5.00/5.00 / 25.00Strong reasoning. Cached: $0.50/MTok.
claude-opus-4-71MOff, High, Max5.00/5.00 / 25.00Strong reasoning. Cached: $0.50/MTok.
claude-sonnet-4-61MOff, High, Max3.00/3.00 / 15.00Best balance of quality and cost. Cached: $0.30/MTok.
claude-opus-4-61MOff, High, Max5.00/5.00 / 25.00Reasoning. Cached: $0.50/MTok.
claude-opus-4-5-20251101200KOff, High, Max5.00/5.00 / 25.00Previous gen. Cached: $0.50/MTok.
claude-sonnet-4-5-20250929200KOff, High, Max3.00/3.00 / 15.00Previous gen. Cached: $0.30/MTok.
claude-haiku-4-5-20251001200KOff, High1.00/1.00 / 5.00Fast and cheap. Not recommended for agentic tasks. Cached: $0.10/MTok.
claude-opus-4-1-20250805200KOff, High, Max15.00/15.00 / 75.00Legacy. Cached: $1.50/MTok.

Reasoning modes

The brain icon next to the message box controls how this model reasons. Click it to cycle through the modes the selected model supports. Two separate ideas combine here:

Thinking — whether the model reasons

  • Off — the model answers immediately. Fastest and cheapest; ideal for simple, direct tasks.
  • On — the model first works through the problem in a dedicated reasoning pass before replying. Slower and uses more tokens, but markedly more accurate on multi-step, logical, or ambiguous tasks.

Effort — how hard it thinks

Only effort-capable models expose this; it applies once thinking is on.
  • High — standard reasoning depth. The right default for most agentic work.
  • Max — the model reasons longer and deeper for the hardest problems. More tokens and latency in exchange for higher quality on complex work.

Button states

StateColourMeaning
OffgrayThinking off — direct answer
OnblueThinking on — no effort control
HighpurpleThinking on, standard effort
MaxorangeThinking on, maximum effort
Each model shows only the states it genuinely supports. If a model always reasons (can’t be turned off) or has no effort control, the button reflects that and locks where there’s nothing to change. Wolffish remembers your choice per model. On Anthropic: Opus, Sonnet and Fable models support Off / High / Max; Haiku supports Off / High. On the newest models (Opus 4-8 and 4-7) thinking is adaptive — the model chooses its own depth — and Anthropic does not return the reasoning text; 4-6 and earlier stream their reasoning.
All models support tool calling with no hard tool-count limit. Anthropic uses ephemeral prefix caching for cost savings on repeated context.