Starter
€399/mo
For small teams starting to integrate AI into their workflows.
- 5 API Keys
- Unlimited tokens · open models
- 2.5B tokens/month · SOTA models
- OpenAI API compatible
- Zero logs · EU data
- Email support
// pricing
Pay per API key, not per token. Unlimited tokens on every plan — no lock-in, cancel anytime.
Starter
€399/mo
For small teams starting to integrate AI into their workflows.
Growth
€1,299/mo
For companies scaling AI with multiple teams or agents.
Scale
€3,199/mo
For organizations with intensive AI use and advanced needs.
Enterprise
Custom
For organizations needing dedicated GPUs and custom configuration.
All plans include RPM limits and concurrency per API Key to guarantee service quality.
If your use case requires total data sovereignty, we deploy and operate the full inference stack inside your own infrastructure. Your models, your data, your prompts — they never leave your network.
talk_to_us →// pricing faq
Everything about plans, limits and billing — before you ask.
Per API key — a flat monthly price. Tokens are unlimited on open models, with no per-token charges and no usage surprises. Your CFO gets a fixed line on the P&L.
Open models (Qwen, Gemma, DeepSeek…) are unlimited on every plan. The monthly cap only applies to frontier/SOTA models, where compute is more expensive. We always reach out before any overage — never a surprise bill.
No. Plans are month-to-month and you can cancel anytime. You run on open-weight models you can always access — no vendor can deprecate your API or change pricing on you overnight.
Yes. Upgrade or downgrade at any time and changes are prorated. As your usage grows you simply move up a tier — the API and your code stay exactly the same.
Limits apply per API key as requests-per-minute and concurrency, to guarantee service quality — not on how many tokens you process. A single key can handle hundreds of millions of tokens a month.
Yes, on Enterprise: dedicated NVIDIA Blackwell hardware, custom and fine-tuned models, and full on-premise deployment inside your own datacenter. Talk to us for a custom quote.
// get started
Skip the AI infra work. Deploy your first private inference endpoint today.
Flat rate. EU data. OpenAI API compatible.
// cookies
We use strictly necessary cookies to run the site and, only with your consent, Google Analytics to understand usage. No advertising, ever — see our Cookie Policy.
// preferences