neurips-2024
A holistic self-supervised data cleaning strategy to detect off-topic samples, near duplicates and label errors.