Inproceedings,

Detecting spammers and content promoters in online video social networks

, , , , and .
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, page 620--627. New York, NY, USA, ACM, (2009)
DOI: 10.1145/1571941.1572047

Abstract

A number of online video social networks, out of which YouTube is the most popular, provides features that allow users to post a video as a response to a discussion topic. These features open opportunities for users to introduce polluted content, or simply pollution, into the system. For instance, <i>spammers</i> may post an unrelated video as response to a popular one aiming at increasing the likelihood of the <i>response</i> being viewed by a larger number of users. Moreover, opportunistic users--<i>promoters</i>--may try to gain visibility to a specific video by posting a large number of (potentially unrelated) responses to boost the rank of the <i>responded video</i>, making it appear in the top lists maintained by the system. Content pollution may jeopardize the trust of users on the system, thus compromising its success in promoting social interactions. In spite of that, the available literature is very limited in providing a deep understanding of this problem.</p> <p>In this paper, we go a step further by addressing the issue of detecting video spammers and promoters. Towards that end, we manually build a test collection of real YouTube users, classifying them as spammers, promoters, and legitimates. Using our test collection, we provide a characterization of social and content attributes that may help distinguish each user class. We also investigate the feasibility of using a state-of-the-art supervised classification algorithm to detect spammers and promoters, and assess its effectiveness in our test collection. We found that our approach is able to correctly identify the majority of the promoters, misclassifying only a small percentage of legitimate users. In contrast, although we are able to detect a significant fraction of spammers, they showed to be much harder to distinguish from legitimate users.

Tags

Users

  • @beate

Comments and Reviews