Question 1

Can I switch plans anytime?

Accepted Answer

Yes. Upgrades take effect immediately and we prorate the difference. Downgrades take effect at the next billing cycle. No fees either way.

Question 2

Are tokens really included? What about overages?

Accepted Answer

Every paid plan ships with a monthly token allowance baked into the price - 5M on Standard, 11.5M on Pro, 22M on Max. There are no surprise overage bills: when your monthly grant runs out, agents pause until you top up your wallet (rolling-over cash at $5.57 per 1M tokens) or wait for the next cycle. You stay in control.

Question 3

What happens if my token wallet hits zero mid-task?

Accepted Answer

The operation stops. No auto-upgrade, no overdraft, no end-of-month invoice. Top up the wallet with as little or as much as you want - top-up balances never expire until you use them - and the task resumes. You can also bring your own LLM provider and pay them directly, which bypasses our tokens entirely.

Question 4

How do top-ups work?

Accepted Answer

Each consumable resource - tokens, web browsing hours, ML compute hours - works like a prepaid wallet. Your plan grants a monthly balance (resets each cycle, lose-it-or-use-it). Top-ups add more on demand, and that cash rolls over as long as your account is active. When the wallet hits zero, the operation stops.

Question 5

Can I bring my own LLM and skip paying for tokens?

Accepted Answer

Yes. Connect any OpenAI-compatible provider - OpenRouter, OpenAI, Anthropic, Together, Groq, vLLM, TGI, Ollama, or your own fine-tuned deployment. You pay them directly. In settings, cancel the bundled token portion of your plan to save the cost.

Question 6

Can I share my plan with my team?

Accepted Answer

The Standard, Pro, and Max plans are single-user. For team use, the Enterprise plan supports multiple seats, shared resources, role-based access, SSO, and audit logs. Custom team pricing is available - talk to sales.

Question 7

What about VAT?

Accepted Answer

All prices on this page are VAT-inclusive (gross). EU B2B customers with a valid VAT ID and non-EU customers see the net price - we don’t charge VAT in those cases.

Question 8

What about data privacy?

Accepted Answer

Your workspace is private and yours. Your hub artifacts are private by default. We don’t train on your data. You can export everything anytime, with no egress fees on hub content.

Question 9

Do you charge for failed agent runs?

Accepted Answer

Tokens consumed by the model are billed whether the agent succeeded or not - same as every other LLM-based service. We don’t charge browser time or extra compute for failures on our end.

Question 10

Is there a free trial of paid plans?

Accepted Answer

The Free plan is open-ended - use it as long as you like, with 500K starter tokens or BYO LLM from day one. If a paid plan doesn’t work for you in the first 14 days, we’ll refund it on request, no questions asked.

Question 11

Can I run an agent 24/7?

Accepted Answer

Yes. Agents run continuously within your plan’s concurrent-agent limit - tokens and web browsing hours are the wallets to watch for long-running tasks. The dashboard shows live consumption so you can tune behavior or top up as needed.

Question 12

How does fine-tuning work? Do I pay for GPUs?

Accepted Answer

You bring your own cloud (AWS, GCP, Lambda, RunPod, etc.) via dstack or skypilot. Your cloud provider bills you directly for GPU time. We orchestrate the run, capture logs and metrics, and track lineage in the hub. No GPU markup from us.

Question 13

How are ML compute hours different from agent tokens?

Accepted Answer

ML compute hours are a separate wallet used for synthetic dataset generation and eval runs (HumanEval, MBPP, terminal-bench, SWE-bench, custom suites). Standard includes 3 hours/month, Pro 15, Max 40. Top-up rate is $2.42/hour and rolls over.

Question 14

Why bring my own cloud instead of using yours?

Accepted Answer

You probably already have credits, a preferred region, specific compliance needs, or a GPU provider relationship. Forcing you onto our infrastructure would mean marking up GPUs you could buy directly. We’d rather make money on the platform than on reselling compute.

Question 15

Can I serve my fine-tuned models?

Accepted Answer

Yes. Deploy via your own cloud (dstack/skypilot), and we orchestrate the endpoint. Your agents can call your custom models directly - no token charges from us when you use them.

Question 16

What’s the hub for?

Accepted Answer

A git-backed registry for your models, datasets, and checkpoints - like a private Hugging Face Hub. Public read for community sharing, private read/write for your work. Backed by Cloudflare R2 with no egress fees. Private storage scales with your plan: 1 GB on Free, 10 GB on Standard, 50 GB on Pro, 200 GB on Max. Need more on any plan? Extra hub storage is a recurring add-on at +$0.0605/GB/month.

Question 17

What if I need more than Max but Enterprise is overkill?

Accepted Answer

Stack add-ons. A Max plan with extra workspace storage (+$0.0605/GB/mo), extra hub storage (+$0.0605/GB/mo), and ongoing wallet top-ups covers a lot of ground before you hit Enterprise territory. Talk to us if you’re not sure.

Agents that get work done.
One bill. No surprises.

Pick a plan. Tokens included.

Enterprise

Bring your own model. Train your own model.

Use your own LLM

Fine-tune your own models

Datasets & evaluation

Internal hub

Questions, answered in plain language.

Can I switch plans anytime?

Are tokens really included? What about overages?

What happens if my token wallet hits zero mid-task?

How do top-ups work?

Can I bring my own LLM and skip paying for tokens?

Can I share my plan with my team?

What about VAT?

What about data privacy?

Do you charge for failed agent runs?

Is there a free trial of paid plans?

Can I run an agent 24/7?

How does fine-tuning work? Do I pay for GPUs?

How are ML compute hours different from agent tokens?

Why bring my own cloud instead of using yours?

Can I serve my fine-tuned models?

What’s the hub for?

What if I need more than Max but Enterprise is overkill?

Build the agent you actually want.

Agents that get work done.One bill. No surprises.

Pick a plan. Tokens included.

Enterprise

Bring your own model. Train your own model.

Use your own LLM

Fine-tune your own models

Datasets & evaluation

Internal hub

Questions, answered in plain language.

Can I switch plans anytime?

Are tokens really included? What about overages?

What happens if my token wallet hits zero mid-task?

How do top-ups work?

Can I bring my own LLM and skip paying for tokens?

Can I share my plan with my team?

What about VAT?

What about data privacy?

Do you charge for failed agent runs?

Is there a free trial of paid plans?

Can I run an agent 24/7?

How does fine-tuning work? Do I pay for GPUs?

How are ML compute hours different from agent tokens?

Why bring my own cloud instead of using yours?

Can I serve my fine-tuned models?

What’s the hub for?

What if I need more than Max but Enterprise is overkill?

Build the agent you actually want.

Agents that get work done.
One bill. No surprises.