De-identify clinical notes without breaking analytics. Joinable tokens + round-trip re-ID + cryptographic audit trails.
Three core differentiators that no competitor offers at our price point.
Same patient across 1000 documents = same token. Enable longitudinal analysis without exposing PHI.
De-ID → Send to LLM → Get response → Re-ID. Critical for AI scribes and clinical documentation AI.
Every operation produces a signed receipt proving what went in, what came out, and what was found.
Detects all 18 HIPAA identifiers plus clinical extensions. Ages >89 automatically generalized.
Process PDF, Word, TXT, JSON, HTML, RTF, Markdown, plus native FHIR R4 and HL7v2 parsing.
Process 100+ documents per second. Pattern engine runs on CPU, no GPU required.
Upload clinical documents in any format. We extract and de-identify automatically.
Scanned or digital PDFs with text extraction
Microsoft Word documents with formatting preserved
.txt, .text, .md Markdown files
Structured data with automatic text extraction
.html, .htm web pages with tag stripping
Rich Text Format documents
Native FHIR Bundle and Resource parsing
ADT, ORU, ORM message parsing
95.4% F1 accuracy. 80% cheaper than AWS Comprehend Medical. 5-minute setup.
| Feature | Open Source Presidio, Philter |
Cloud APIs AWS, Azure |
Enterprise Private AI, JSL |
RedactiPHI |
|---|---|---|---|---|
| F1 Score | ~70-75%* | 83-91% | 96-98% | 95.4% |
| Precision | Varies widely | 85-95% | 97%+ | 95.7% |
| Recall | 53-65%* | 80-88% | 93-99% | 95.2% |
| HIPAA Compliant | You're responsible | With BAA | Yes | Yes + BAA |
| Starting Price | Free + DevOps | ~$1/GB inspect | $10k+/yr | $0 (25 docs free) |
| 5,000 docs/month | Free + your infra | ~$1,000/mo | $5,000+/mo | $299/mo |
| Setup Time | Days to weeks | Hours | Weeks to months | 5 minutes |
| Infrastructure | Self-managed | Cloud-only | On-premise required | Fully managed API |
| Developer Dashboard | None | Basic console | None | Full dashboard + analytics |
| SDKs & Libraries | DIY integration | Vendor SDKs | Contact sales | Python, Node, cURL ready |
| Re-identification | Build your own | Not available | Limited | One-click API |
| Audit Receipts | Not included | CloudTrail logs | Enterprise only | Cryptographic proof |
| Webhooks | Not included | SNS/EventBridge | Custom integration | Built-in |
Start free, scale as you grow. No hidden fees.
For testing
For indie devs
For teams
For production
For healthcare orgs
Security and compliance are foundational, not afterthoughts.
In progress. Expected 2026.
BAA available for all paid plans.
PHI never stored. Memory only.
TLS 1.3 + AES-256-GCM.
High-value integrations we're actively building.
Full-cycle PHI protection. De-identify before sending to LLM, then re-identify in responses. Cloud-hosted with audit logging.
HIPAA-compliant chatbot for health questions. True stateful conversations with 200k context window. Powered by Claude.
The ngrok for PHI. Drop-in replacement for OpenAI, Anthropic, and Gemini SDKs. One import change, automatic PHI de-identification, all local. Open source.
Bulk de-identification for clinical trials and research. Our joinable tokenization lets you link patient data across sites while maintaining privacy.
Native integrations where clinicians already work. SMART on FHIR apps for Epic, Cerner, Athenahealth, and other major EHRs.
Want early access to Redact Chat?
Join the WaitlistOne endpoint. JSON in, JSON out. Start in minutes.
# De-identify clinical text curl -X POST https://api.redact.health/api/v1/deidentify \ -H "Content-Type: application/json" \ -H "Authorization: Bearer YOUR_API_KEY" \ -d '{ "text": "Patient John Smith, DOB 01/15/1980", "policy": "safe_harbor" }' # Response { "text": "Patient [NAM_abc123], DOB 02/02/1980", "document_id": "doc-xyz789", "phi_found": 2, "phi_types": {"PATIENT_NAME": 1, "DOB": 1} }