data-contamination
Local contamination checks for eval data overlap, hashes, and n-gram leakage.
Python package developed to evaluate textual overlap (N-Grams) between two volumes of text.