PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Pii Python Packages

Python packages with the GitHub topic pii. Sorted by relevance, with stars and monthly downloads.
Microsoft
presidio-analyzer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

4.9M 8K 1K
Microsoft
presidio-anonymizer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

3.5M 8K 1K
Microsoft
presidio-image-redactor

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

103K 8K 1K
wan9yu
argus-redact

Encrypt PII, not meaning. Locally.

99K 6 0
berislavlopac
sanitary

Utility to remove or replace sensitive data from complex structures.

66K 3 1
datafog
datafog

Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines for production privacy workflows.

48K 55 13
capitalone
dataprofiler

What's in your data? Extract schema, statistics and entities from datasets

42K 2K 186
microsoft
presidio-structured

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

15K 8K 1K
armurox
loggingredactor

Logging Redactor is a Python library designed to redact sensitive data in logs based on regex patterns and / or dictionary keys.

9K 6 0
Microsoft
presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

8K 8K 1K
ttarvis
hexlock

Format-preserving redaction for PII and sensitive data that works with LLMs/text-based pipelines

6K 6 0
kylemclaren
scrubadubdub

A Python package to scrub PII

6K 25 8
solentlabs
har-capture

Capture and sanitize HAR (HTTP Archive) files with deep PII removal. Perfect for support diagnostics, security reviews, and test fixtures.

6K 3 1
cloakllm
cloakllm

Python SDK — PII cloaking middleware for LLM calls (spaCy NER + regex + Ollama)

5K 0 0
seanpedrick-case
doc-redaction

Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface. Demo: https://huggingface.co/spaces/seanpedrickcase/document_redaction or with try with VLMs: https://huggingface.co/spaces/seanpedrickcase/document_redaction_vlm

3K 50 10
cloakllm
cloakllm-mcp

MCP server — CloakLLM tools for Claude Desktop and MCP clients

3K 0 0
opendsr-std
seedfaker

Deterministic synthetic data generator for realistic, correlated, and noisy test records across 68 locales. Rust CLI/Python/Node.js/Browser WASM/Go/PHP/Ruby/MCP

2K 23 0
EdyVision
pii-codex

A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

2K 98 11
Tatarinho
llm-safe-pl

[DEPRECATED — use pii-toolkit] Reversible Polish PII anonymization for LLM workflows. Successor packages: pii-veil, pii-core, pii-presidio.

1K 2 0
tokern
piicatcher

Find PII data in databases

1K 341 98
nextaim-de
noirdoc

German-first PII redaction and pseudonymization for documents. Local by default. Reversible when you need it.

1K 4 0
rohitcoder
hawk-scanner

A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.

1K 486 51
parvathirajan
the-mask

A package to hide/mask PII information in the JSON object

888 2 1
zafrem
data-detector

Data-detector is a Python-based PII detection and protection framework featuring multi-language NLP support, RAG security, and data tokenization capabilities.

845 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery