marshtompcs / semdedup Goto Github PK
View Code? Open in Web Editor NEWThis project forked from facebookresearch/semdedup
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
License: Other