RichVSM: enRiched Vector Space Models for Folksonomies
R. Abbasi, and S. Staab. HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia, New York, NY, USA, ACM, (July 2009)
Abstract
People share millions of resources (photos, bookmarks, videos, etc.) in Folksonomies (like Flickr, Delicious, Youtube, etc.). To access and share resources, they add keywords called tags to the resources. As the tags are freely chosen keywords, it might not be possible for users to tag their resources with all the relevant tags. As a result, many resources lack sufficient number of relevant tags. The lack of relevant tags results into sparseness of data, and this sparseness of data makes many relevant resources unsearchable against user queries.
In this paper, we explore two dimensions of semantic relationships between tags, based on the context and the distribution of tags. We exploit semantic relationships between tags to reduce sparseness in Folksonomies and propose different enriched vector space models. We also propose a vector space model Best of Breed which utilizes appropriate enrichment method based on the type of the query. We evaluate the proposed methods on a large dataset of 27 million resources, 92 thousand tags and 94 million tag assignments. Experimental results show that the enriched vector space models help in improving search, especially for the rare queries which have few relevant resources in the sparse data.
%0 Conference Paper
%1 abbasi2009richvsm
%A Abbasi, Rabeeh
%A Staab, Steffen
%B HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia
%C New York, NY, USA
%D 2009
%I ACM
%K folksonomies fp056 fullPaper ht2009 information model models reduction retrieval search smoothing space sparseness tagging vector
%T RichVSM: enRiched Vector Space Models for Folksonomies
%X People share millions of resources (photos, bookmarks, videos, etc.) in Folksonomies (like Flickr, Delicious, Youtube, etc.). To access and share resources, they add keywords called tags to the resources. As the tags are freely chosen keywords, it might not be possible for users to tag their resources with all the relevant tags. As a result, many resources lack sufficient number of relevant tags. The lack of relevant tags results into sparseness of data, and this sparseness of data makes many relevant resources unsearchable against user queries.
In this paper, we explore two dimensions of semantic relationships between tags, based on the context and the distribution of tags. We exploit semantic relationships between tags to reduce sparseness in Folksonomies and propose different enriched vector space models. We also propose a vector space model Best of Breed which utilizes appropriate enrichment method based on the type of the query. We evaluate the proposed methods on a large dataset of 27 million resources, 92 thousand tags and 94 million tag assignments. Experimental results show that the enriched vector space models help in improving search, especially for the rare queries which have few relevant resources in the sparse data.
@inproceedings{abbasi2009richvsm,
abstract = {People share millions of resources (photos, bookmarks, videos, etc.) in Folksonomies (like Flickr, Delicious, Youtube, etc.). To access and share resources, they add keywords called tags to the resources. As the tags are freely chosen keywords, it might not be possible for users to tag their resources with all the relevant tags. As a result, many resources lack sufficient number of relevant tags. The lack of relevant tags results into sparseness of data, and this sparseness of data makes many relevant resources unsearchable against user queries.
In this paper, we explore two dimensions of semantic relationships between tags, based on the context and the distribution of tags. We exploit semantic relationships between tags to reduce sparseness in Folksonomies and propose different enriched vector space models. We also propose a vector space model Best of Breed which utilizes appropriate enrichment method based on the type of the query. We evaluate the proposed methods on a large dataset of 27 million resources, 92 thousand tags and 94 million tag assignments. Experimental results show that the enriched vector space models help in improving search, especially for the rare queries which have few relevant resources in the sparse data.},
added-at = {2009-06-16T15:00:02.000+0200},
address = {New York, NY, USA},
author = {Abbasi, Rabeeh and Staab, Steffen},
biburl = {https://www.bibsonomy.org/bibtex/29e7a2a323235eed80b2b91bbf2e659cd/ht09},
booktitle = {HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia},
interhash = {beeda6b9f798af218a7f51aaa399e45e},
intrahash = {9e7a2a323235eed80b2b91bbf2e659cd},
keywords = {folksonomies fp056 fullPaper ht2009 information model models reduction retrieval search smoothing space sparseness tagging vector},
month = {July},
paperid = {fp056},
publisher = {ACM},
session = {Full Paper},
timestamp = {2009-06-16T15:00:04.000+0200},
title = {RichVSM: enRiched Vector Space Models for Folksonomies},
year = 2009
}