Xiaomi MiMo (Budget-Friendly)

POST https://api.xiaomimimo.com/v1/chat/completions

Uses SSE streaming with OpenAI-compatible tool-calling format. Supports vision (base64 images) and PDF document input. MiMo is a budget-friendly cloud provider offering competitive models at a fraction of the cost of Anthropic and OpenAI. Following Xiaomi’s permanent price cut of up to 99% — matching DeepSeek V4 Pro rates — MiMo V2.5 Pro delivers strong reasoning and multilingual support at dramatically lower cost than frontier models. Best for: Cost-sensitive agentic workflows, multilingual tasks, multi-step tool chains, and high-volume automations where budget matters.

MiMo handles complex agentic tasks well — including long multi-step tool chains, research workflows, code generation, and autonomous automations. We recommend starting with MiMo and upgrading to Anthropic or OpenAI only if execution isn’t reliable enough for a specific workflow, or if you need computer-use (screen interaction), which only Anthropic supports. See Choosing a Provider for details.

Getting an API Key

Go to platform.xiaomimimo.com
Sign up or log in
Navigate to API Keys and create a new key
Paste it into Wolffish → Settings → Models → Xiaomi Mimo

Models

Model	Context	Modes	Input / Output (per MTok)	Cached	Notes
mimo-v2.5-pro	1M	Off, On	$0.20 /$ 2.00	Free	Best MiMo model. Strong reasoning and multilingual.
mimo-v2.5	1M	Off, On	$0.08 /$ 0.80	Free	Good balance of quality and cost.
mimo-v2-pro	256K	Off, On	$0.20 /$ 2.00	Free	Previous generation pro model.
mimo-v2-omni	256K	Off, On	$0.08 /$ 0.80	Free	Multimodal. Vision and document input.
mimo-v2-flash	256K	Off, On	$0.01 /$ 0.30	—	Ultra-cheap. Best for high-volume, low-complexity tasks.

Reasoning modes

The brain icon next to the message box controls how this model reasons. Click it to cycle through the modes the selected model supports. Two separate ideas combine here:

Thinking — whether the model reasons

Off — the model answers immediately. Fastest and cheapest; ideal for simple, direct tasks.
On — the model first works through the problem in a dedicated reasoning pass before replying. Slower and uses more tokens, but markedly more accurate on multi-step, logical, or ambiguous tasks.

Effort — how hard it thinks

Only effort-capable models expose this; it applies once thinking is on.

High — standard reasoning depth. The right default for most agentic work.
Max — the model reasons longer and deeper for the hardest problems. More tokens and latency in exchange for higher quality on complex work.

Button states

State	Colour	Meaning
Off	gray	Thinking off — direct answer
On	blue	Thinking on — no effort control
High	purple	Thinking on, standard effort
Max	orange	Thinking on, maximum effort

Each model shows only the states it genuinely supports. If a model always reasons (can’t be turned off) or has no effort control, the button reflects that and locks where there’s nothing to change. Wolffish remembers your choice per model. On MiMo: All MiMo chat models are a simple On / Off toggle — no effort tiers.

Cost Comparison

MiMo’s pricing positions it alongside DeepSeek as a budget tier provider, dramatically cheaper than Anthropic and OpenAI:

Provider	Flagship Model	Input / Output (per MTok)	Relative Cost
MiMo	mimo-v2.5-pro	$0.20 /$ 2.00	1× (baseline)
DeepSeek	deepseek-v4-pro	$0.435 /$ 0.87	~1–2×
Kimi	kimi-k2.6	$0.95 /$ 4.00	~2–5×
Anthropic	claude-opus-4.7	$5.00 /$ 25.00	12–25×
OpenAI	gpt-5.5	$5.00 /$ 30.00	15–25×

MiMo and DeepSeek are 10–25× cheaper than Anthropic and OpenAI and handle complex agentic workloads well. Start here — upgrade only if a specific workflow isn’t executing reliably, or if you need computer-use (Anthropic only).

When to Use MiMo

Good fit — most workflows:

Multi-step tool chains (file search → read → edit → commit)
Research → summarize → write workflows
Scheduled automations and daily task processing
Multilingual tasks — MiMo has strong Chinese, English, and multilingual support
Email drafting, report generation, data extraction
Conversational agents and Q&A
High-volume workloads where cost per token matters

Consider premium tier if:

Computer-use / screen interaction is needed (requires Anthropic)
You’ve tried a workflow on MiMo and execution isn’t reliable enough — upgrade to Anthropic or OpenAI for that specific task

Start with MiMo or DeepSeek as your Brain model and experiment with your actual workflows. Most users find they rarely need the premium tier. If a specific task isn’t executing reliably, switch your Brain to Anthropic or OpenAI — there’s no automatic fallback, so the swap is yours to make.

​Xiaomi MiMo (Budget-Friendly)

​Getting an API Key

​Models

​Reasoning modes

​Thinking — whether the model reasons

​Effort — how hard it thinks

​Button states

​Cost Comparison

​When to Use MiMo

Xiaomi MiMo (Budget-Friendly)

Getting an API Key

Models

Reasoning modes

Thinking — whether the model reasons

Effort — how hard it thinks

Button states

Cost Comparison

When to Use MiMo