- Social Spam Detection
- Postfix now includes the sample greylisting policy-daemon in the main release (2.1+): www.postfix.org/SMTPD_POLICY_README.html#greylist.
- Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine’s results. While human experts can identify spam, it i...Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine’s results. While human experts can identify spam, it is too expensive to manually evaluate a large number of pages. Instead, we propose techniques to semi-automatically separate reputable, good pages from spam. We first select a small set of seed pages to be evaluated by an expert. Once we manually identify the reputable seed pages, we use the link structure of the web to discover other pages that are likely to be good. In this paper we discuss possible ways to implement the seed selection and the discovery of good pages. We present results of experiments run on the World Wide Web indexed by AltaVista and evaluate the performance of our techniques. Our results show that we can effectively filter out spam from a significant fraction of the web, based on a good seed set of less than 200 sites.
- Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies, page 15:1--15:8. New York, NY, USA, ACM, (2011)
- TREC 2006 Blog Track Notebook (2006)
- 7th Conference on Computer Methods and Systems, Krakow, Poland, (November 2009)ISBN 83-916420-5-4 .
- Proc. LeGo-09: From Local Patterns to Global Models, Workshop at the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, (2009)accepted .
- IEEE Internet Computing 11(6):36-45 (2007)
- AIRWeb, page 41-48. (2009)
- AIRWeb '07: Proceedings of the 3rd international workshop on Adversarial information retrieval on the web, page 57--64. New York, NY, USA, ACM Press, (2007)
- AIRWeb '08: Proceedings of the 4th international workshop on Adversarial information retrieval on the web, page 61--68. New York, NY, USA, ACM, (2008)
- CEAS, (2005)
- (2004)
- (2005)
- DocEng '06: Proceedings of the 2006 ACM symposium on Document engineering, page 107--114. New York, NY, USA, ACM Press, (2006)
- Stanford Univ., (2005)
- ECML, page 96-107. (2005)
- (2005)
- University of California, Los Angeles, (February 2004)
- VLDB, page 576-587. (2004)


user