Ai Data Pipeline Python Packages

knowlyr-modelaudit

LLM distillation detection & model fingerprinting via statistical forensics — behavioral probing, stylistic signatures & representation similarity. CLI + MCP ready.

2K 2 0

ai-dataset-radar

Multi-source async competitive intelligence engine for AI training data ecosystems with watermark-driven incremental scanning & anomaly detection. CLI + MCP ready.

722 4 1

knowlyr-datasynth

Seed-to-scale LLM synthetic data engine with auto-detected templates, schema validation & quality-diversity optimization. CLI + MCP ready.

591 1 0

knowlyr-crew

AI Skill Loader - define professional skills in Markdown, load into Claude Code and other AI IDEs via MCP

524 2 0

knowlyr-datarecipe

Automated dataset reverse engineering framework — 6-stage analysis pipeline, LLM-enhanced cost modeling & 23+ production documents. CLI + MCP ready.

518 0 0

knowlyr-datacheck

Composable rule engine for LLM data quality validation with IQR/Z-score anomaly detection & auto-fix pipeline. CLI + MCP ready.

415 0 0

knowlyr-datalabel

Serverless annotation framework with LLM pre-labeling, inter-annotator agreement analysis & offline HTML interface. CLI + MCP ready.

361 0 0

knowlyr-hub