Pricing
Pay only for what you use.
Pay-as-you-go. Every API call deducts in pence against a credit balance, billed monthly to a card or invoice. No subscription required. (Self-service top-up lands during the public beta — for now we invoice in arrears against beta keys.)
| What | Rate | Unit |
|---|---|---|
Chat completions (LLM) qwen2.5-32b-instruct Blended rate — input + output tokens charged at the same rate. No separate prompt/completion pricing. | £1.00 | per 1M tokens |
Speech-to-text whisper-large-v3-turbo Billed on actual audio duration (not file size). 6-second clip = 0.1 minutes = £0.0005. | £0.005 | per audio minute |
Embeddings bge-m3 Coming soon. Pricing indicative. | £0.05 | per 1M tokens |
All prices excluding VAT. UK customers see VAT added at checkout. EU B2B customers reverse-charge per their local rules.
Private beta access
Email us, we issue an API key the same day. No card required during the beta — first month's usage is on us up to £20, billed in arrears after that.
Request beta accessSovereign Enterprise
Dedicated capacity, signed DPA, named-subprocessor disclosure, audit-log retention to your specification, 24-hour SLA. For banks, public sector, and regulated industries.
Talk to sales →What's included at every price
- Hardware located and operated in the United Kingdom
- OpenAI-compatible API surface (drop-in for any OpenAI SDK)
- Per-call audit log retained for 30 days minimum
- Streaming responses on chat completions (SSE)
- Token-level usage metering visible in dashboard
- VAT-compliant invoicing (UK + EU)