Inproceedings,

CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl

, , , , , , and .
Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, (July 2021)
DOI: 10.1145/3404835.3463246

Meta data

Tags

Users

  • @jaeschke
  • @dblp

Comments and Reviews