keep me in the loop
Receive fresh thinking on AI governance, compliance, and innovation written for leaders and builders like you.
Thank you! Check your email for confirmation.
Your Model, Your rules!
CLōD gives you direct access to 25+ top-tier models, all in one flexible API.
Choose your model, then tune for price, latency, or token efficiency.
Available Models
Llama

Llama 3 70B Reference
Llama 3 8B Lite
Llama 3.1 405B
Llama 3.1 8B
Llama 3.1 8B Turbo

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Llama

Llama 3.2 3B
Llama 3.2 3B Turbo
Llama 4 Maverick
Llama 4 Maverick 17B
Llama 4 Scout
Llama 4 Scout 17B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Deepseek

Deepseek V3
Deepseek R1
Deepseek R1 Distill Llama 70B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Claude

Claude Sonnet 4
Claude Opus 4
Claude 3.7 Sonnet
Claude 3.5 Sonnet

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
GPT

GPT 4 Turbo
GPT 4o Mini
GPT 4o
GPT 4.1
GPT OSS 120B
GPT OSS 20B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Gemini

Gemini 2.5 Pro
Gemini 2.5 Flash

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Grok

Grok 4
Grok 3

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Mistral

Mistral 7B Instruct v0.3

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Qwen

Qwen 3 32B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Arcee

Arcee Coder Large
Arcee Maestro
Arcee Virtuoso Large

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Cogito

Cogito V2 109BMOE
Cogito V2 405B
Cogito V2 671B
Cogito V2 70B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Gemma

Gemma 3N E4B

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Kimmi

Kimi K2 Instruct
Kimi K2 Thinking

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.
Marin

Marin 8B Instruct

Deployable governance your clients can adopt quickly: enforce rules, block risky outputs, maintain reliability, and provide 360° monitoring to turn policy into practice fast.

Model pricing

We are a pay-as-you-go platform, allowing you to control your budget and token usage. Governance and RAG are monthly add-ons that can be canceled at any time.
For a limited time, we are offering free access to both services!
Model (A to Z)** Input
(USD / 1M Tokens)
Output
(USD / 1M Tokens)
Arcee Coder Large $0.45 $0.72
Arcee Maestro $0.81 $2.97
Arcee Virtuoso Large $0.68 $1.08
Claude Opus 4 $13.50 $67.50
Claude Sonnet 4 $2.70 $13.50
Cogito V2 109B MOE $0.16 $0.16
Cogito V2 405B $3.15 $3.15
Cogito V2 671B $1.13 $1.13
Cogito V2 70B $0.79 $0.79
Deepseek R1* $0.50 $1.97
Deepseek R1 Distill Llama 70B $0.63 $1.26
Deepseek R1 Distill Qwen 14B $0.16 $0.16
Deepseek R1 Throughput $0.50 $1.97
Deepseek V3* $0.50 $1.51
Deepseek V3.1 $0.54 $1.53
Gemini 2.5 Flash $0.27 $2.25
Gemini 2.5 Pro $1.13 $9.00
Gemma 3N E4B $0.02 $0.04
GPT 4 Turbo $9.00 $27.00
GPT 4.1 $1.80 $7.20
GPT 4o $2.25 $9.00
GPT 4o Mini $0.14 $0.54
GPT OSS 120B $0.14 $0.54
GPT OSS 20B $0.05 $0.18
Grok 3 $2.70 $13.50
Grok 4 $2.70 $13.50
Kimi K2 Instruct $0.90 $2.70
Kimi K2 Thinking $1.08 $3.60
Llama 3 70B Reference $0.79 $0.79
Llama 3 8B Lite $0.09 $0.09
Llama 3.1 405B $3.15 $3.15
Llama 3.1 8B* $0.05 $0.07
Llama 3.1 8B Turbo $0.16 $0.16
Llama 3.2 3B $0.05 $0.05
Llama 3.3 70B* $0.54 $1.08
Llama 3.3 70B Turbo $0.79 $0.79
Llama 4 Maverick $0.20 $0.79
Llama 4 Maverick 17B $0.24 $0.77
Llama 4 Scout $0.14 $0.54
Llama 4 Scout 17B $0.16 $0.53
Marin 8B Instruct $0.16 $0.16
Qweb 3 Next 80B Thinking $0.14 $1.35
Qwen 2.5 72B Turbo $1.08 $1.08
Qwen 2.5 7B Turbo $0.27 $0.27
Qwen 2.5 VL 72B $1.76 $7.20
Qwen 3 235B Instruct $0.18 $0.54
Qwen 3 235B Thinking $0.59 $2.70
Qwen 3 235B Throughput $0.18 $0.54
Qwen 3 32B $0.36 $0.72
Qwen 3 Coder 480B $1.80 $1.80
Qwen 3 Next 80B Instruct $0.14 $1.35
* Indicates the lowest possible rate based on inference configuration.
** CLōD continuously adds the latest frontier models upon request.
Quick FAQs
How do I integrate CLoD into my workflow (SDKs, APIs)?

You can connect via direct API calls today, use the OpenAI SDK with minimal setup, and soon we’ll release official SDKs and support for open standards like MCP (Model Context Protocol), a new way to plug tools together without extra coding.

How long does setup take?

Setup takes less than 5 minutes. You simply create an account, generate an API key, and drop it into your workflow. From there, you can start making governed AI requests right away after selecting a preset rulset or customizing your own.

Can I run CLōD on my own infrastructure, or only on yours?

Today, CLōD runs as a managed service on our infrastructure. We handle the reliability, monitoring, and updates so you don’t have to. In the future, we plan to offer deployment options that let you run CLōD on your own infrastructure if needed.

What AI models are currently supported?

CLoD currently supports 26 leading models from providers including OpenAI, Anthropic, Google, Fireworks, xAI, Together, Sambanova, Cerebras, and Groq. This includes well-known models like GPT-4.1, Claude Opus 4, Gemini 2.5, Grok 4, and multiple Llama variants. We’re continuously adding new models, so you’ll always have access to the latest and best-performing options.

With CLōD
Control isn’t an afterthought.