@ncrn-cornell

Beyond Pairwise: Provably Fast Algorithms for Approximate k-Way Similarity Search

, and . Advances in Neural Information Processing Systems 26, page 791--799. Curran Associates, Inc., (2013)

Abstract

We go beyond the notion of pairwise similarity and look into search problems with k-way similarity functions. In this paper, we focus on problems related to 3-way Jaccard similarity: R^3way= |S_1 S_2 S_3||S_1 S_2 S_3|, S_1, S_2, S_3 C, where C is a size n collection of sets (or binary vectors). We show that approximate R^3way similarity search problems admit fast algorithms with provable guarantees, analogous to the pairwise case. Our analysis and speedup guarantees naturally extend to k-way resemblance. In the process, we extend traditional framework of locality sensitive hashing (LSH) to handle higher order similarities, which could be of independent theoretical interest. The applicability of R^3way search is shown on the Google sets" application. In addition, we demonstrate the advantage of R^3way resemblance over the pairwise case in improving retrieval quality."

Links and resources

Tags

community

  • @ncrn-cornell
  • @dblp
@ncrn-cornell's tags highlighted