Article,

Large Scale Online Learning of Image Similarity Through Ranking

G. Chechik, V. Sharma, U. Shalit, and S. Bengio.
Journal of Machine Learning Research, (2010)
DOI: 10.1145/1756006.1756042

Abstract

Learning a measure of similarity between pairs of objects is an important generic problem in machine learning. It is particularly useful in large scale applications like searching for an image that is similar to a given image or finding videos that are relevant to a given video. In these tasks, users look for objects that are not only visually similar but also semantically related to a given object. Unfortunately, the approaches that exist today for learning such semantic similarity do not scale to large data sets. This is both because typically their CPU and storage requirements grow quadratically with the sample size, and because many methods impose complex positivity constraints on the space of learned similarity functions. The current paper presents OASIS, an Online Algorithm for Scalable Image Similarity learning that learns a bilinear similarity measure over sparse representations. OASIS is an online dual approach using the passive-aggressive family of learning algorithms with a large margin criterion and an efficient hinge loss cost. Our experiments show that OASIS is both fast and accurate at a wide range of scales: for a data set with thousands of images, it achieves better results than existing state-of-the-art methods, while being an order of magnitude faster. For large, web scale, data sets, OASIS can be trained on more than two million images from 150K text queries within 3 days on a single CPU. On this large scale data set, human evaluations showed that 35% of the ten nearest neighbors of a given test image, as found by OASIS, were semantically relevant to that image. This suggests that query independent similarity could be accurately learned even for large scale data sets that could not be handled before.

BibTeX key: chechik2010large
entry type: article
year: 2010
journal: Journal of Machine Learning Research
pages: 1109-1135
volume: 11
DOI: 10.1145/1756006.1756042

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 chechik2010large %A Chechik, Gal %A Sharma, Varun %A Shalit, Uri %A Bengio, Samy %D 2010 %J Journal of Machine Learning Research %K %P 1109-1135 %R 10.1145/1756006.1756042 %T Large Scale Online Learning of Image Similarity Through Ranking %V 11 %X Learning a measure of similarity between pairs of objects is an important generic problem in machine learning. It is particularly useful in large scale applications like searching for an image that is similar to a given image or finding videos that are relevant to a given video. In these tasks, users look for objects that are not only visually similar but also semantically related to a given object. Unfortunately, the approaches that exist today for learning such semantic similarity do not scale to large data sets. This is both because typically their CPU and storage requirements grow quadratically with the sample size, and because many methods impose complex positivity constraints on the space of learned similarity functions. The current paper presents OASIS, an Online Algorithm for Scalable Image Similarity learning that learns a bilinear similarity measure over sparse representations. OASIS is an online dual approach using the passive-aggressive family of learning algorithms with a large margin criterion and an efficient hinge loss cost. Our experiments show that OASIS is both fast and accurate at a wide range of scales: for a data set with thousands of images, it achieves better results than existing state-of-the-art methods, while being an order of magnitude faster. For large, web scale, data sets, OASIS can be trained on more than two million images from 150K text queries within 3 days on a single CPU. On this large scale data set, human evaluations showed that 35% of the ten nearest neighbors of a given test image, as found by OASIS, were semantically relevant to that image. This suggests that query independent similarity could be accurately learned even for large scale data sets that could not be handled before.

@article{chechik2010large, abstract = {Learning a measure of similarity between pairs of objects is an important generic problem in machine learning. It is particularly useful in large scale applications like searching for an image that is similar to a given image or finding videos that are relevant to a given video. In these tasks, users look for objects that are not only visually similar but also semantically related to a given object. Unfortunately, the approaches that exist today for learning such semantic similarity do not scale to large data sets. This is both because typically their CPU and storage requirements grow quadratically with the sample size, and because many methods impose complex positivity constraints on the space of learned similarity functions. The current paper presents OASIS, an Online Algorithm for Scalable Image Similarity learning that learns a bilinear similarity measure over sparse representations. OASIS is an online dual approach using the passive-aggressive family of learning algorithms with a large margin criterion and an efficient hinge loss cost. Our experiments show that OASIS is both fast and accurate at a wide range of scales: for a data set with thousands of images, it achieves better results than existing state-of-the-art methods, while being an order of magnitude faster. For large, web scale, data sets, OASIS can be trained on more than two million images from 150K text queries within 3 days on a single CPU. On this large scale data set, human evaluations showed that 35% of the ten nearest neighbors of a given test image, as found by OASIS, were semantically relevant to that image. This suggests that query independent similarity could be accurately learned even for large scale data sets that could not be handled before.}, added-at = {2016-10-31T22:18:10.000+0100}, author = {Chechik, Gal and Sharma, Varun and Shalit, Uri and Bengio, Samy}, biburl = {https://www.bibsonomy.org/bibtex/2bff104a30eb8e739a55ab30238459988/nosebrain}, doi = {10.1145/1756006.1756042}, interhash = {a425b0dc6f27b67948a886e6e0e5ced8}, intrahash = {bff104a30eb8e739a55ab30238459988}, journal = {Journal of Machine Learning Research}, keywords = {}, pages = {1109-1135}, timestamp = {2016-10-31T22:18:10.000+0100}, title = {Large Scale Online Learning of Image Similarity Through Ranking}, volume = 11, year = 2010 }

BibSonomy

Large Scale Online Learning of Image Similarity Through Ranking

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on