Plans

Simple pricing. No surprises.

Two consumer plans, both with optional usage extension. No free tier. No annual lock-in. Cancel anytime.

Pro

$20 per month

For engineers who want Cala for daily problem-solving.

200k Opus-equivalent output tokens per month
30k tokens per 24h window
Access to all frontier models (Opus 4.7, GPT-5.5, Gemini 3.5 Flash, and more)
Local knowledge base on your machine
Overage: $5 buys an extra 50k tokens

Get started

Max

$100 per month

For heavy users who lean on Cala across every project.

1M Opus-equivalent output tokens per month
150k tokens per 24h window
Access to all frontier models
Local knowledge base on your machine
Overage: $20 buys an extra 250k tokens

Get started

All limits are measured in Opus-equivalent output tokens. Cheaper models stretch the budget further: Sonnet 4.6 ~5x, Gemini 3.5 Flash ~3x, Haiku 4.5 ~5x.

Deployment

Deployed where your IP requires.

Pick the level. Three options, every one auditable, traceable, with your choice of which model runs the reasoning.

Level 1

On your own servers.

Air-gapped or in your data center. Your data never leaves your physical infrastructure. The only option for top-tier fabs and vendors.

Level 2

In your private cloud.

Inside your existing AWS, Azure, or GCP account. Your data, your account, your security boundary. We deploy and configure.

Level 3

Managed by Cala.

Hosted single-tenant for teams that need to start fast. Your encryption keys. Your audit logs. Migrate to Level 1 or 2 whenever you are ready.

Engineers shouldn't have to choose between using AI and protecting their company's IP. We've built the boring answer to that.

Enterprise

Beyond the desktop.

For organizations that need on-prem deployment, IP boundaries, SSO, audit logs, custom model routing, or local model support. We work directly with fabs and vendors who can't put a single prompt on the public internet. Reach out to talk through what you need.

Talk to us about enterprise