Coming Soon
The following features are in development and not yet publicly available. They will be released in future updates.
Compliance Modes
Currently available: MASKED and PROPRIETARY (Guardian Layer).
GDPR
Replaces detected PII with typed placeholders aligned to GDPR personal data categories.
Output example:
Input: "Call Sarah at (415) 555-0123 or email sarah@example.com."
Output: "Call <PERSON> at <PHONE_NUMBER> or email <EMAIL_ADDRESS>."
HIPAA
HIPAA Safe Harbor de-identification. Removes or generalizes the 18 identifying data elements defined under HIPAA Safe Harbor, enabling compliant use of health data for research and analytics.
CUSTOM
Supply your own replacement map. Each key is the original term; the value is what replaces it in the output. Useful when you want precise control over every substitution rather than automatic detection.
Planned usage:
{
"text": "Project Nighthawk is led by Marcus Webb.",
"compliance_mode": "CUSTOM",
"replacements": {
"Nighthawk": "[PROJECT]",
"Marcus Webb": "[LEAD]"
}
}
Domain-Specific Tuning
Guardian Layer uses domain-tuned models to improve detection accuracy for specialized content. Currently only the General domain is available.
| Domain | Use for | Status |
|---|---|---|
General | Business documents, general text | Available |
Medical | Clinical notes, patient records | Coming soon |
Finance | Banking, investment, trading documents | Coming soon |
Custom | Specialized or mixed-domain corpora | Coming soon |
Until domain-specific models are released, use domain: "General" for all requests. The domain parameter is accepted but has no effect beyond the General model.
Image De-Identification
OCR-based detection and masking for images containing text. Detected text regions are covered with filled rectangles. Supports JPEG, PNG, BMP, and TIFF input, and returns a masked PNG.
Endpoint: POST /api/v1/deidentify/image
This endpoint is fully implemented but not yet publicly released.
Stay Updated
Follow releases and announcements at the CustodianAI Dashboard.