ON-PREMISES DEPLOYMENT
Deploy Recall on your own servers. Every document, every embedding, every query stays within your network. No cloud dependency, no external calls, no compromise on privilege.
Recall Server runs as a lightweight backend service. Clients connect from any machine on your network.
Everything in Recall Desktop, plus the infrastructure controls your IT team expects.
Deploy Recall on your office server so every attorney can access the same research. It works on your existing hardware with one simple Docker setup.
Your client files never touch the internet. Recall works entirely behind your office firewall, keeping your research truly private and protected.
Multiple attorneys on the same case. Share document folders, coordinate your research, and manage your firm's archives in one central place.
Store your firm's entire document library on your own server. Your IT team manages backups and security using the tools they already know.
Recall adapts to your office setup. Your IT team chooses the models, storage, and performance tier that fits your specific needs.
Keep data in your jurisdiction. No complex workarounds — just simple, local control over your firm's data and regulatory requirements.
Our team works directly with your IT staff to plan, configure, and validate your on-premises installation. Ongoing support included.
Recall Server scales from single-rack deployments to full data center installations.
Small to mid-size firms (5–25 users)
70B-class models on a single GPU. Fast concurrent inference for the entire firm.
Mid-size to large firms (25–100 users)
400B-class models via CPU offload or multi-node. Handles heavy concurrent loads at scale.
Large organizations (100+ users)
Frontier-class models at full precision. Multi-GPU tensor parallelism across H100 clusters with NVLink.
Talk to our team about getting Recall running on your infrastructure.
FREE EVALUATION FOR QUALIFIED FIRMS. VOLUME LICENSING AVAILABLE.