An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
-
Updated
Oct 23, 2025 - Python
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Intelligent Router for Mixture-of-Models
An AI-powered Personal Identifiable Information (PII) scanner.
Mediapipe-based library to redact faces from videos and images
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
A Swiss-Army-knife for your Data Intelligence platform administration.
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance
An example Next.js application protected by Arcjet.
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
SafeChat Slack Bot is an open-source project designed to enhance data security within Slack workspaces.
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
Maskwise detects, redacts, masks, and anonymizes sensitive data across text, images, and structured data in training datasets for LLM systems. Powered by Microsoft Presidio
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.
Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)
LLM Semantic Router: Intelligent Mixture-of-Models (MoM) System with Privacy Preservation and Prompt Guard. The semantic router intelligently directs OpenAI compliant API requests to the most suitable backend models based on semantic understanding of request content.
Add a description, image, and links to the pii-detection topic page so that developers can more easily learn about it.
To associate your repository with the pii-detection topic, visit your repo's landing page and select "manage topics."