Cellule-PRO · 2026 pricing

Simple pricing, young project

Annual license per pool. Unlimited queries. No per-token billing, no surprise. Performance depends on your hardware — you don't pay for cloud inference, you buy the software that orchestrates.

🎁 Pioneers — first 3 organizations get extended privileges
2 years free (vs 60 days standard BETA), then lifetime pioneer pricing: €1,000 / year for pool #1, €750 for pool #2, €500 for pool #3 and beyond. Logo + 1 public citation in exchange.
See pioneer offer →

4 plans based on your organization size

Solo / Micro

3-lawyer firm, solo doctor, accountant
1–5 users

€990
/ year · 1 pool · excl. VAT
  • 1 pool (single site)
  • Unlimited workers based on your HW
  • Up to 5 users
  • Unlimited queries
  • Chat + conversational RAG
  • Document RAG (PDF)
  • Built-in employee web UI (chat, RAG, projects) — no OpenWebUI required
  • OpenAI-compatible API (opencode, Continue, OpenWebUI if preferred)
  • Admin dashboard + GDPR self-service
  • Updates included
  • Email support (72h business hours)
  • Multi-site cluster
  • White-label branding
Request a POC

Starter

Law firm, group medical practice, SMB
5–30 users

€2,490
/ year · 1 pool · excl. VAT
  • 1 pool (single site)
  • Unlimited workers based on your HW
  • Up to 30 users
  • Unlimited queries
  • Everything in Solo, plus
  • Live topology (control room)
  • Advanced append-only audit
  • Partial white-label branding
  • Offline JWT license
  • Email support (48h business hours)
  • Multi-site cluster
Request a POC

Enterprise

Group ≥ 200 users
or strictly regulated sector

Custom
from €19,990 / year · excl. VAT
  • Unlimited pools
  • Unlimited users
  • Everything in Pro, plus
  • Contractual SLA (99.9%+)
  • 24/7 support (option)
  • Full white-label branding
  • On-site air-gap production ceremony
  • SSO integration (SAML / OIDC)
  • Joint security audit
  • EU AI Act compliance assistance
  • Source code escrow
  • Co-built roadmap
  • Custom DPA
Discuss

What you actually buy

Cellule-PRO is orchestration software, not a cloud offering. Inference runs on your existing infrastructure (idle employee desktops with CPU/GPU, or dedicated servers you already own). You pay no cloud inference cost, no tokens, no lock-in.

Performance therefore depends on your hardware: expect 15-50 tokens/second per worker depending on hardware (2B-30B models Q4_K_M, local execution). It's not GPT-4-cloud throughput, but it's fast, and it stays 100% on your premises — including when the EU AI Act asks you to account for it in August 2026.

✦ Included in all plans (even Solo)

Signed Docker image
Remote-guided installation
Interactive first-boot wizard
ADMIN_GUIDE + API_REFERENCE documentation
8 architecture HTML/SVG diagrams
GDPR articles 15-22 native
Append-only audit trail
Offline Ed25519 JWT license
Minor version updates
Zero-touch worker auto-upgrade
Guaranteed air-gap runtime
One-click GDPR export
EU AI Act compliance ready
Win / Linux / macOS workers
26 DSI-configurable admin flags
Hardened binary delivery (compiled .so for IP-critical modules)
Long-term crypto continuity (multi-key signing, graceful rotation)

60-day BETA — zero commitment

Before signing an annual license, you're entitled to a 60-day BETA. For Solo, Starter and Pro, the BETA is free (or symbolic: €200 if the deployment complexity requires hand-holding). You evaluate in real conditions with your pilot employees. Not convinced at day 60? We stop cleanly, your data stays yours.
🚀 2026 early-adopter pricing. The project is young (rc92bc in production), we personally onboard the first 20 customers. If you evaluate today, you directly influence the roadmap and benefit from a protected price for the first 3 years.

À-la-carte options

Specific needs not in your plan? Here's what we can add on demand.

On-site production ceremony

Production Ed25519 key initialization ritual on your premises, with your DSI and DPO present. 1 day.

€990 / engagement

On-site employee training

3h session on your premises, up to 20 employees. Chat, RAG, project mode, GDPR self-service.

€690 / session

SSO / LDAP integration

SAML 2.0, OIDC, internal LDAP. Unified authentication with Active Directory or Azure AD.

€1,490 setup

Custom business plugin

ERP connector, legal DMS, hospital EHR. Custom quote based on complexity.

from €1,990

Migration from Ollama / Open WebUI

Transfer your conversations, existing RAG, prompts. 2-3 day engagement.

€990 flat

Joint security audit

Review with your CISO: pentest, architecture, hardening. 2-3 engineer days.

€1,490 / day

Frequently asked questions — pricing

Why no per-token billing like OpenAI?

Because inference runs on your infrastructure, not ours. We don't know how much you consume and we don't want to. You pay an annual license, period. No end-of-month surprise.

What if we exceed the planned user count?

Tiers are indicative, not technically enforced. If you grow from 28 to 35 users mid-year, we revisit once a year at renewal. No instant blocking, no stress.

What does performance look like concretely?

On a desktop with RTX 2060 (6 GB VRAM) + 4B Q4_K_M model: 40-50 tok/s. On a workstation RTX 3090 + 30B: 15-25 tok/s. On a MacBook M2 Pro: 20-40 tok/s. Fast for chat and RAG, slower than cloud GPT-4 on long reasoning. The pool automatically caps the context based on your VRAM to avoid saturation.

What if Cellule-PRO stops the project?

Enterprise tier = source code escrow with a trusted third party (escrow.com or equivalent). If we stop, you recover the code and continue autonomously. For other tiers, the runtime is air-gap: as long as your JWT license is valid, your pool runs without external dependency.

No minimum commitment?

An annual license, that's it. If you don't renew the following year, your pool keeps running with your data but without new versions or support. No threat, no leak.