@jaeschke

CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl

, , , , , , and . Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, (July 2021)
DOI: 10.1145/3404835.3463246

Description

CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl | Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

Links and resources

Tags

community

  • @jaeschke
  • @dblp
@jaeschke's tags highlighted