// 00AI model atlas

The models we build with and how we pick them.

// //fleet · what we serve right now

…

// //which model? · pick by task

What do you want to do?

The router picks:

Mistral Large 3

assistant on your knowledge, customer support, content translation

see the model →

Want to play with the models live? Playground · Model arena

Flagship engines — default primaries

DeepSeek-V4live

For: hard decisions, document analysis, agent step planning

reasoning

Mistral Large 3live

For: assistant on your knowledge, customer support, content translation

chattranslationvision

Qwen3 / Qwen3-Coderlive

For: code generation and refactor, integrations, multilingual tasks

code

GLM-5.2 / GLM-5live

For: chat, quality comparisons, tasks with light reasoning

chat

Gemma 3 / Gemma 4live

For: summaries, classification, extraction, cheap tasks at scale

summarizationfast answersclassification

Devstral-2live

For: writing code, automations, agent tooling, technical integrations

code

Also served

Ministral-3live

For: intent routing, lead classification, field extraction, quick answers

fast answersclassificationextraction

Qwen3-VLlive

For: listing photo analysis, document reading, image description and tagging

Kimi K2.7 / K2.6live

For: photo analysis, image description and tagging, long-context tasks

vision

MiniMax M3 / M2.7live

For: mixed tasks, a backup chat engine, arena experiments

Nemotron-3 Ultra / Superlive

For: real-time interfaces, fast feedback, short answers

GPT-OSSlive

For: fallback reasoning, analytical tasks, provider independence

reasoning

Cogito-2.1live

For: fallback reasoning, high-volume analysis, quality comparisons

reasoning

Gemini 3 Flashlive

For: long-document analysis, whole knowledge bases in one pass

Embeddings (local)

BGE-M3

For: semantic search, RAG, data privacy — the foundation of any AI knowledge base

Integrated on demand

Llama 4

For: on-prem deployments, full control over weights, no cloud dependency

Claude Opus 4.x

For: hardest reasoning, code work, quality-critical tasks

GPT-5 / o-series

For: premium tasks where the client requires a specific provider, quality benchmarking

Claude Sonnet 4.x / Haiku

For: production assistants, code work, agents with a good quality-to-cost ratio

Grok 4 (xAI)

For: tasks needing current information, trend analysis, live research

Command A (Cohere)

For: enterprise-grade RAG, company search, assistants with hard citation

Phi-4 (Microsoft)

For: on-prem deployments on modest hardware, edge tasks, low unit cost

// //FAQ

How do you pick the AI model for a task?

The OpenClaw router picks the cheapest model that can carry the task — based on measured throughput, time-to-first-token and context window. Each task type (chat, reasoning, code, vision, summarization) has a primary and a fallback.

Does my data go to the cloud?

PII is masked before anything goes to the cloud, and embeddings are computed locally with BGE-M3. Sensitive data and full on-prem deployments never leave your infrastructure.

How many models do you actually serve?

Today 43 models are available through the router with measured parameters (the live “fleet” band). Frontier models (Claude Opus, GPT-5, Gemini) are integrated additionally on demand.

Not sure which model fits your task? Tell us what you want to build — let's talk →