🎁 Pioneers — first 3 organizations get extended privileges

2 years free (vs 60 days standard BETA), then lifetime pioneer pricing: €1,000 / year for pool #1, €750 for pool #2, €500 for pool #3 and beyond. Logo + 1 public citation in exchange.

See pioneer offer →

4 plans based on your organization size

Solo / Micro

3-lawyer firm, solo doctor, accountant
1–5 users

€990

/ year · 1 pool · excl. VAT

1 pool (single site)
Unlimited workers based on your HW
Up to 5 users
Unlimited queries
Chat + conversational RAG
Document RAG (PDF)
Built-in employee web UI (chat, RAG, projects) — no OpenWebUI required
OpenAI-compatible API (opencode, Continue, OpenWebUI if preferred)
Admin dashboard + GDPR self-service
Updates included
Email support (72h business hours)
Multi-site cluster
White-label branding

Request a POC

Starter

Law firm, group medical practice, SMB
5–30 users

€2,490

/ year · 1 pool · excl. VAT

1 pool (single site)
Unlimited workers based on your HW
Up to 30 users
Unlimited queries
Everything in Solo, plus
Live topology (control room)
Advanced append-only audit
Partial white-label branding
Offline JWT license
Email support (48h business hours)
Multi-site cluster

Request a POC

Pro

Mid-cap, multi-site group, regional hospital
30–200 users

€6,990

/ year · up to 3 pools · excl. VAT

Up to 3 federated pools
Unlimited workers per pool
Up to 200 users
Unlimited queries
Everything in Starter, plus
Multi-site cluster (Ed25519 federation)
Symmetric admin HA cross-pool
Cross-pool RAG fanout (zero-data-out)
Incidents & Migrations admin workflow
24h failsafe auto-escalation
Email support (24h business hours)
1 monthly review call

Request a POC

Enterprise

Group ≥ 200 users
or strictly regulated sector

Custom

from €19,990 / year · excl. VAT

Unlimited pools
Unlimited users
Everything in Pro, plus
Contractual SLA (99.9%+)
24/7 support (option)
Full white-label branding
On-site air-gap production ceremony
SSO integration (SAML / OIDC)
Joint security audit
EU AI Act compliance assistance
Source code escrow
Co-built roadmap
Custom DPA

Discuss

What you actually buy

Cellule-PRO is orchestration software, not a cloud offering. Inference runs on your existing infrastructure (idle employee desktops with CPU/GPU, or dedicated servers you already own). You pay no cloud inference cost, no tokens, no lock-in.

Performance therefore depends on your hardware: expect 15-50 tokens/second per worker depending on hardware (2B-30B models Q4_K_M, local execution). It's not GPT-4-cloud throughput, but it's fast, and it stays 100% on your premises — including when the EU AI Act asks you to account for it in August 2026.

✦ Included in all plans (even Solo)

Signed Docker image

Remote-guided installation

Interactive first-boot wizard

ADMIN_GUIDE + API_REFERENCE documentation

8 architecture HTML/SVG diagrams

GDPR articles 15-22 native

Append-only audit trail

Offline Ed25519 JWT license

Minor version updates

Zero-touch worker auto-upgrade

Guaranteed air-gap runtime

One-click GDPR export

EU AI Act compliance ready

Win / Linux / macOS workers

26 DSI-configurable admin flags

Hardened binary delivery (compiled .so for IP-critical modules)

Long-term crypto continuity (multi-key signing, graceful rotation)

60-day BETA — zero commitment

      Before signing an annual license, you're entitled to a 60-day BETA.
      For Solo, Starter and Pro, the BETA is
      free (or symbolic: €200 if the deployment complexity requires
      hand-holding). You evaluate in real conditions with your pilot employees. Not
      convinced at day 60? We stop cleanly, your data stays yours.
    

      🚀 2026 early-adopter pricing. This is an early production
      release, and we personally onboard the first 20 customers.
      If you evaluate today, you directly influence the roadmap and benefit from a
      protected price for the first 3 years.
    

À-la-carte options

Specific needs not in your plan? Here's what we can add on demand.

Done-for-you setup — turnkey confidential-document RAG

Prefer we handle everything? We deploy the pool, ingest your files — contracts, case files, briefs, jurisprudence — into a private RAG that never leaves your LAN, and train your team. Your staff ask in plain language and get answers with source citations, no confidential document ever sent to a cloud. For organizations that want a working result without touching the setup. Adapts to law firms, healthcare, notary, industrial R&D.

from €2,490 one-off setup — on top of your plan

On-site production ceremony

Production Ed25519 key initialization ritual on your premises, with your DSI and DPO present. 1 day.

€990 / engagement

On-site employee training

3h session on your premises, up to 20 employees. Chat, RAG, project mode, GDPR self-service.

€690 / session

SSO / LDAP integration

SAML 2.0, OIDC, internal LDAP. Unified authentication with Active Directory or Azure AD.

€1,490 setup

Custom business plugin

ERP connector, legal DMS, hospital EHR. Custom quote based on complexity.

from €1,990

Migration from Ollama / Open WebUI

Transfer your conversations, existing RAG, prompts. 2-3 day engagement.

€990 flat

Joint security audit

Review with your CISO: pentest, architecture, hardening. 2-3 engineer days.

€1,490 / day

Frequently asked questions — pricing

Why no per-token billing like OpenAI?

Because inference runs on your infrastructure, not ours. We don't know how much you consume and we don't want to. You pay an annual license, period. No end-of-month surprise.

What if we exceed the planned user count?

Tiers are indicative, not technically enforced. If you grow from 28 to 35 users mid-year, we revisit once a year at renewal. No instant blocking, no stress.

What does performance look like concretely?

On a desktop with RTX 2060 (6 GB VRAM) + 4B Q4_K_M model: 40-50 tok/s. On a workstation RTX 3090 + 30B: 15-25 tok/s. On a MacBook M2 Pro: 20-40 tok/s. Fast for chat and RAG, slower than cloud GPT-4 on long reasoning. The pool automatically caps the context based on your VRAM to avoid saturation.

What if Cellule-PRO stops the project?

Enterprise tier = source code escrow with a trusted third party (escrow.com or equivalent). If we stop, you recover the code and continue autonomously. For other tiers, the runtime is air-gap: as long as your JWT license is valid, your pool runs without external dependency.

No minimum commitment?

An annual license, that's it. If you don't renew the following year, your pool keeps running with your data but without new versions or support. No threat, no leak.

Simple pricing, young project

4 plans based on your organization size

Solo / Micro

Starter

Pro

Enterprise

What you actually buy

✦ Included in all plans (even Solo)

60-day BETA — zero commitment

À-la-carte options

Done-for-you setup — turnkey confidential-document RAG

On-site production ceremony

On-site employee training

SSO / LDAP integration

Custom business plugin

Migration from Ollama / Open WebUI

Joint security audit

Frequently asked questions — pricing

Why no per-token billing like OpenAI?

What if we exceed the planned user count?

What does performance look like concretely?

What if Cellule-PRO stops the project?

No minimum commitment?