An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
-
Updated
Sep 1, 2025 - Python
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
An AI-powered Personal Identifiable Information (PII) scanner.
Mediapipe-based library to redact faces from videos and images
Intelligent Mixture-of-Models Router for Efficient LLM Inference
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
A Swiss-Army-knife for your Data Intelligence platform administration.
The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
π‘οΈ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs β designed to support data privacy and GDPR compliance
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
An example Next.js application protected by Arcjet.
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
Maskwise detects, redacts, masks, and anonymizes sensitive data across text, images, and structured data in training datasets for LLM systems. Powered by Microsoft Presidio
Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)
Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.
Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈ
LLM Semantic Router: Intelligent Mixture-of-Models (MoM) System with Privacy Preservation and Prompt Guard. The semantic router intelligently directs OpenAI compliant API requests to the most suitable backend models based on semantic understanding of request content.
Add a description, image, and links to the pii-detection topic page so that developers can more easily learn about it.
To associate your repository with the pii-detection topic, visit your repo's landing page and select "manage topics."