PiiRemover surgically removes PII from any document — PDFs, scanned images, plain text — using a stacked engine of 11 detection strategies. 100% on your servers. Zero cloud exposure.
The Fields page is where you create and tune every detection rule — fields, patterns, preserve lists, and name dictionaries. The real admin UI, exactly as it looks.
Terms listed here override all rules — institution names, medicine names, etc.
The built-in guide — live in your browser — walks every step with real before/after examples and direct links to each section.
Built for healthcare, legal, and financial teams who process sensitive documents at scale.
Define any number of PII field types. Each gets its own name, replace char, priority, and stacked patterns.
Runs entirely on your infrastructure. No document ever leaves your network.
From clean digital PDFs to scanned paper. Dual OCR with automatic fallback.
Scheduled backups run in the background. Browse, download, or restore any point instantly.
Every API call logged. 30-day call chart, top matched fields, per-client breakdown.
Import 80,000+ name lists. Scope limits matching to the document header — no false positives in body text.
POST a file, receive sanitized text with full match metadata. Any language, any platform.
Issue separate keys for every system. Revoke, rotate, or audit each client independently from the admin panel.
Every response includes startIndex, length, field name, matched text, and replacement — your systems know exactly what and where.
Set per-client request quotas. Every call logged with filename, duration, matches, and timestamp. Filter & export CSV for compliance.
Interactive API documentation at /swagger — auto-generated, always accurate, no external dependency.