Y. Wang, and J. Chu. HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia, New York, NY, USA, ACM, (July 2009)
Link-based ranking algorithm is very important for current success and popular of Web Search Engine. In order to get high rank, some try to improve contents of web pages while others just put dirty tricks, such as link spam. Link spam is a trick targeting at link-based ranking algorithms by artificially created tight link structures to push some target pages get undeserved high ranks. This problem becomes even worse with the advent of wikis, blogs, forums, which are rich in links. We tackle the problem of improving link-based ranking by more fundamental viewpoint--noisy link analysis. Motivated by how non-voting hyperlinks affect quality of ranking, we propose an approach and corresponding penalty strategies to both detect and handle noisy link effectively and automatically. We also compared our approach with other related works to demonstrate that our approach is rather effective in noisy link filtering and could improve the final search results significantly.