@ncrn-cornell

Beyond Pairwise: Provably Fast Algorithms for Approximate k-Way Similarity Search

, und . Advances in Neural Information Processing Systems 26, Seite 791--799. Curran Associates, Inc., (2013)

Zusammenfassung

We go beyond the notion of pairwise similarity and look into search problems with k-way similarity functions. In this paper, we focus on problems related to 3-way Jaccard similarity: R^3way= |S_1 S_2 S_3||S_1 S_2 S_3|, S_1, S_2, S_3 C, where C is a size n collection of sets (or binary vectors). We show that approximate R^3way similarity search problems admit fast algorithms with provable guarantees, analogous to the pairwise case. Our analysis and speedup guarantees naturally extend to k-way resemblance. In the process, we extend traditional framework of locality sensitive hashing (LSH) to handle higher order similarities, which could be of independent theoretical interest. The applicability of R^3way search is shown on the Google sets" application. In addition, we demonstrate the advantage of R^3way resemblance over the pairwise case in improving retrieval quality."

Links und Ressourcen

Tags

Community

  • @ncrn-cornell
  • @dblp
@ncrn-cornells Tags hervorgehoben