PyRank
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Data Contamination Python Packages

Python packages with the GitHub topic data-contamination. Sorted by relevance, with stars and monthly downloads.
auraoneai
contamination-audit

Local contamination checks for eval data overlap, hashes, and n-gram leakage.

332 0 0
nlx-group
overlapy

Python package developed to evaluate textual overlap (N-Grams) between two volumes of text.

87 10 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery