Article,

Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data

B. Mobasher, H. Dai, T. Luo, and M. Nakagawa.
(2001)

Abstract

Recommender systems based on collaborative filtering usually require real-time comparison of users' ratings on objects. In the context of Web personalization, particularly at the early stages of a visitor's interaction with the site (i.e., before registration or authentication), recommender systems must rely on anonymous clickstream data. The lack of explicit user ratings and the shear amount of data in such a setting poses serious challenges to standard collaborative filtering techniques in terms of scalability and performance. Offline clustering of users transactions can be used to improve the scalability of collaborative filtering, however, this is often at the cost of reduced recommendation accuracy. In this paper we study the impact of various preprocessing techniques applied to clickstream data, suchasclustering, normalization, and significance filtering, on collaborative filtering. Our experimental results, performed on real usage data, indicate that with proper data preparation, the clustering-based approach to collaborative filtering can achieve dramatic improvements in terms of recommendation effectiveness, while maintaining the computational advantage over the direct approaches such as the k-Nearest- Neighbor technique.

BibTeX key: mobasher01
entry type: article
booktitle: In Proceedings of the IJCAI 2001 Workshop on Intelligent Techniques for Web Personalization (ITWP01
year: 2001
pages: 53--60
review: Recommender systems based on collaborative filtering usually require real-time comparison of users' ratings on objects. In the context of Web personalization, particularly at the early stages of a visitor's interaction with the site (i.e., before registration or authentication), recommender systems must rely on anonymous clickstream data. The lack of explicit user ratings and the shear amount of data in such a setting poses serious challenges to standard collaborative filtering techniques in terms of scalability and performance. Offline clustering of users transactions can be used to improve the scalability of collaborative filtering, however, this is often at the cost of reduced recommendation accuracy. In this paper we study the impact of various preprocessing techniques applied to clickstream data, suchasclustering, normalization, and significance filtering, on collaborative filtering. Our experimental results, performed on real usage data, indicate that with proper data preparation, the clustering-based approach to collaborative filtering can achieve dramatic improvements in terms of recommendation effectiveness, while maintaining the computational advantage over the direct approaches such as the k-Nearest- Neighbor technique

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{mobasher01, abstract = {Recommender systems based on collaborative filtering usually require real-time comparison of users' ratings on objects. In the context of Web personalization, particularly at the early stages of a visitor's interaction with the site (i.e., before registration or authentication), recommender systems must rely on anonymous clickstream data. The lack of explicit user ratings and the shear amount of data in such a setting poses serious challenges to standard collaborative filtering techniques in terms of scalability and performance. Offline clustering of users transactions can be used to improve the scalability of collaborative filtering, however, this is often at the cost of reduced recommendation accuracy. In this paper we study the impact of various preprocessing techniques applied to clickstream data, suchasclustering, normalization, and significance filtering, on collaborative filtering. Our experimental results, performed on real usage data, indicate that with proper data preparation, the clustering-based approach to collaborative filtering can achieve dramatic improvements in terms of recommendation effectiveness, while maintaining the computational advantage over the direct approaches such as the k-Nearest- Neighbor technique.}, added-at = {2009-06-22T17:28:38.000+0200}, author = {Mobasher, Bamshad and Dai, Honghua and Luo, Tao and Nakagawa, Miki}, biburl = {https://www.bibsonomy.org/bibtex/27be493d6cb1088d1190c07e8a18a626c/lefteris8}, booktitle = {In Proceedings of the IJCAI 2001 Workshop on Intelligent Techniques for Web Personalization (ITWP01}, interhash = {a428e91c64d8ebfe9a02ae62fc725f63}, intrahash = {7be493d6cb1088d1190c07e8a18a626c}, keywords = {clustering_based collaborative_filtering scalability}, pages = {53--60}, review = {Recommender systems based on collaborative filtering usually require real-time comparison of users' ratings on objects. In the context of Web personalization, particularly at the early stages of a visitor's interaction with the site (i.e., before registration or authentication), recommender systems must rely on anonymous clickstream data. The lack of explicit user ratings and the shear amount of data in such a setting poses serious challenges to standard collaborative filtering techniques in terms of scalability and performance. Offline clustering of users transactions can be used to improve the scalability of collaborative filtering, however, this is often at the cost of reduced recommendation accuracy. In this paper we study the impact of various preprocessing techniques applied to clickstream data, suchasclustering, normalization, and significance filtering, on collaborative filtering. Our experimental results, performed on real usage data, indicate that with proper data preparation, the clustering-based approach to collaborative filtering can achieve dramatic improvements in terms of recommendation effectiveness, while maintaining the computational advantage over the direct approaches such as the k-Nearest- Neighbor technique}, timestamp = {2009-06-22T17:28:39.000+0200}, title = {Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data}, year = 2001 }

BibSonomy

Improving the Effectiveness of Collaborative Filtering on Anonymous Web Usage Data

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on