Try it out. See if it fits your work.
- up to 5 agents
- 500K starter tokens
- 1 GB workspace
- 1 GB hub storage
- 30 min web browsing
- Community support
Pick a plan, sign up, and start building. Tokens are included by default. Advanced users can bring their own LLM provider or train custom models.
Every plan ships the whole platform - runtime, hub, dev toolkit - with a generous monthly token allowance baked in. Advanced users can bring their own LLM provider and pay them directly.
All prices VAT-inclusive (gross). EU B2B with valid VAT ID and non-EU customers see the net price.
Try it out. See if it fits your work.
For personal projects and occasional automation.
For daily workflows and serious automation.
For power users, small businesses, and multi-agent setups.
Custom token volume. Dedicated compute. SSO, audit logs, RBAC. SLA, dedicated support, custom contracts. For teams of 5+, regulated industries, and ML teams shipping production models.
Contact sales →Need more tokens or browser time? Top up your wallet ↓
Available on every paid plan. Pick what you need, configure in your account settings. No GPU markup from us, no double-dipping.
Connect any OpenAI-compatible provider. You pay them directly. Cancel the bundled token portion of your plan in settings to save the cost.
Train on your own data using dstack or skypilot. Your cloud, your GPUs, your bill - we orchestrate the run, capture logs and metrics, and track lineage in the hub.
Generate synthetic datasets at scale. Run standardized or custom eval suites. ML compute hours work like the other wallet resources - plan grant resets monthly, top-ups roll over.
A git-backed registry for your models, datasets, and checkpoints - like a private Hugging Face Hub. Public read for community sharing, private read/write for your work. Backed by Cloudflare R2 - no egress fees.
Yes. Upgrades take effect immediately and we prorate the difference. Downgrades take effect at the next billing cycle. No fees either way.
Every paid plan ships with a monthly token allowance baked into the price - 5M on Standard, 11.5M on Pro, 22M on Max. There are no surprise overage bills: when your monthly grant runs out, agents pause until you top up your wallet (rolling-over cash at $5.57 per 1M tokens) or wait for the next cycle. You stay in control.
The operation stops. No auto-upgrade, no overdraft, no end-of-month invoice. Top up the wallet with as little or as much as you want - top-up balances never expire until you use them - and the task resumes. You can also bring your own LLM provider and pay them directly, which bypasses our tokens entirely.
Each consumable resource - tokens, web browsing hours, ML compute hours - works like a prepaid wallet. Your plan grants a monthly balance (resets each cycle, lose-it-or-use-it). Top-ups add more on demand, and that cash rolls over as long as your account is active. When the wallet hits zero, the operation stops.
Yes. Connect any OpenAI-compatible provider - OpenRouter, OpenAI, Anthropic, Together, Groq, vLLM, TGI, Ollama, or your own fine-tuned deployment. You pay them directly. In settings, cancel the bundled token portion of your plan to save the cost.
The Standard, Pro, and Max plans are single-user. For team use, the Enterprise plan supports multiple seats, shared resources, role-based access, SSO, and audit logs. Custom team pricing is available - talk to sales.
All prices on this page are VAT-inclusive (gross). EU B2B customers with a valid VAT ID and non-EU customers see the net price - we don’t charge VAT in those cases.
Your workspace is private and yours. Your hub artifacts are private by default. We don’t train on your data. You can export everything anytime, with no egress fees on hub content.
Tokens consumed by the model are billed whether the agent succeeded or not - same as every other LLM-based service. We don’t charge browser time or extra compute for failures on our end.
The Free plan is open-ended - use it as long as you like, with 500K starter tokens or BYO LLM from day one. If a paid plan doesn’t work for you in the first 14 days, we’ll refund it on request, no questions asked.
Yes. Agents run continuously within your plan’s concurrent-agent limit - tokens and web browsing hours are the wallets to watch for long-running tasks. The dashboard shows live consumption so you can tune behavior or top up as needed.
You bring your own cloud (AWS, GCP, Lambda, RunPod, etc.) via dstack or skypilot. Your cloud provider bills you directly for GPU time. We orchestrate the run, capture logs and metrics, and track lineage in the hub. No GPU markup from us.
ML compute hours are a separate wallet used for synthetic dataset generation and eval runs (HumanEval, MBPP, terminal-bench, SWE-bench, custom suites). Standard includes 3 hours/month, Pro 15, Max 40. Top-up rate is $2.42/hour and rolls over.
You probably already have credits, a preferred region, specific compliance needs, or a GPU provider relationship. Forcing you onto our infrastructure would mean marking up GPUs you could buy directly. We’d rather make money on the platform than on reselling compute.
Yes. Deploy via your own cloud (dstack/skypilot), and we orchestrate the endpoint. Your agents can call your custom models directly - no token charges from us when you use them.
A git-backed registry for your models, datasets, and checkpoints - like a private Hugging Face Hub. Public read for community sharing, private read/write for your work. Backed by Cloudflare R2 with no egress fees. Private storage scales with your plan: 1 GB on Free, 10 GB on Standard, 50 GB on Pro, 200 GB on Max. Need more on any plan? Extra hub storage is a recurring add-on at +$0.0605/GB/month.
Stack add-ons. A Max plan with extra workspace storage (+$0.0605/GB/mo), extra hub storage (+$0.0605/GB/mo), and ongoing wallet top-ups covers a lot of ground before you hit Enterprise territory. Talk to us if you’re not sure.
Start free with 500K tokens and no credit card. Pick a plan when you need more agents, more tokens, or your own fine-tuned models. Bring your own LLM whenever you like.