PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Privacy Python Packages

Python packages with the GitHub topic data-privacy. Sorted by relevance, with stars and monthly downloads.
Microsoft
presidio-analyzer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

4.9M 8K 1K
Microsoft
presidio-anonymizer

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

3.5M 8K 1K
Microsoft
presidio-image-redactor

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

103K 8K 1K
ethyca
ethyca-fides

The Privacy Engineering & Compliance Framework

72K 455 90
datafog
datafog

Python SDK for PII detection and redaction in text and images, combining regex + NLP pipelines for production privacy workflows.

48K 55 13
microsoft
presidio-structured

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

15K 8K 1K
IBM
diffprivlib

Diffprivlib: The IBM Differential Privacy Library

12K 912 209
Microsoft
presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

8K 8K 1K
IFCA-Advanced-Computing
pycanon

pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.

5K 52 9
AI-SDC
acro

Tools for the Semi-Automatic Checking of Research Outputs. These are tools for researchers to use as drop-in replacements for common analysis commands.

4K 23 12
martaajonees
clip-protocol

Protocol to ensure the privatization of

3K 0 1
ashutoshrana
enterprise-rag-patterns

FERPA/HIPAA/GDPR-compliant RAG patterns: identity-scoped retrieval, audit logging, and framework adapters for regulated enterprise AI

3K 1 0
bibinprathap
veritas-reason

VeritasGraph — open-source Knowledge Graph & GraphRAG framework on GitHub. Build multi-hop reasoning, ontology-aware retrieval, and verifiable attribution over your own data. Nodes, edges, RDF, linked-data — runs locally or in the cloud.

3K 279 32
Tatarinho
llm-safe-pl

[DEPRECATED — use pii-toolkit] Reversible Polish PII anonymization for LLM workflows. Successor packages: pii-veil, pii-core, pii-presidio.

1K 2 0
ethyca
fidesctl

CLI for Fides

1K 455 90
AI-SDC
aisdc

Tools for the statistical disclosure control of machine learning models

1K 37 8
Wolido
openaaas-mcp-adapter

OpenAaaS: science agent network — bring AI to your data, not your data to AI. AaaS, Agent as a Service, MCP protocol, local execution, Docker sandbox, zero-config Rust nodes.

998 9 5
dataxid
dataxid

The Synthetic Data API. Generate privacy-safe synthetic data with 5 lines of code.

995 24 8
FCA-Advanced-Computing
trasgodp

Local differential privacy mechanisms

894 2 0
zafrem
data-detector

Data-detector is a Python-based PII detection and protection framework featuring multi-language NLP support, RAG security, and data tokenization capabilities.

845 0 0
Blake104
optout

Self-hosted Python CLI that automates CCPA/CPRA opt-out requests to data brokers. Local. Open source.

765 2 0
brokenbartender
sovereign-vault

Reversible PII tokenization for LLM pipelines — send documents to cloud AI without exposing real data

663 0 0
brootware
pyredactkit

Python CLI tool to redact and un-redact sensitive data from text files. 🔐📝

614 50 7
Microsoft
presidio-image-redactor-pai-mirror

Presidio image redactor package MIRRORED FOR PAI. NOT MEANT FOR GENERAL USE.

569 8K 1K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery