PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Ai Data Pipeline Python Packages

Python packages with the GitHub topic ai-data-pipeline. Sorted by relevance, with stars and monthly downloads.
liuxiaotong
knowlyr-modelaudit

LLM distillation detection & model fingerprinting via statistical forensics — behavioral probing, stylistic signatures & representation similarity. CLI + MCP ready.

1K 2 0
liuxiaotong
ai-dataset-radar

Multi-source async competitive intelligence engine for AI training data ecosystems with watermark-driven incremental scanning & anomaly detection. CLI + MCP ready.

671 2 1
liuxiaotong
knowlyr-datarecipe

Automated dataset reverse engineering framework — 6-stage analysis pipeline, LLM-enhanced cost modeling & 23+ production documents. CLI + MCP ready.

661 0 0
liuxiaotong
knowlyr-datasynth

Seed-to-scale LLM synthetic data engine with auto-detected templates, schema validation & quality-diversity optimization. CLI + MCP ready.

503 1 0
liuxiaotong
knowlyr-datacheck

Composable rule engine for LLM data quality validation with IQR/Z-score anomaly detection & auto-fix pipeline. CLI + MCP ready.

411 0 0
liuxiaotong
knowlyr-datalabel

Serverless annotation framework with LLM pre-labeling, inter-annotator agreement analysis & offline HTML interface. CLI + MCP ready.

360 0 0
liuxiaotong
knowlyr-sandbox

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

271 3 0
liuxiaotong
knowlyr-hub

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

265 3 0
liuxiaotong
knowlyr-recorder

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

260 3 0
liuxiaotong
knowlyr-reward

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

252 3 0
liuxiaotong
knowlyr-core

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

243 3 0
liuxiaotong
knowlyr-trainer

Gymnasium-style RL framework for LLM agent training — MDP environments, three-layer process reward & SFT/DPO/GRPO policy optimization. CLI + MCP ready.

143 3 0
liuxiaotong
knowlyr-crew

AI Skill Loader - define professional skills in Markdown, load into Claude Code and other AI IDEs via MCP

8 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery