<rdf:RDF xmlns:burst="http://xmlns.com/burst/0.1/" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"><channel rdf:about="http://www.bibsonomy.org/burst/user/beate/features"><title>BibSonomy publications for /user/beate/features</title><link>http://www.bibsonomy.org/burst/user/beate/features</link><description>BibSonomy BuRST Feed for /user/beate/features</description><dc:date>2008-09-06T20:51:32+02:00</dc:date><items><rdf:Seq><rdf:li rdf:resource="http://www.bibsonomy.org/bibtex/2c93f4228fd8552bede071569cdaa1ad9/beate"/></rdf:Seq></items></channel><item rdf:about="http://www.bibsonomy.org/bibtex/2c93f4228fd8552bede071569cdaa1ad9/beate"><title>Detecting spam web pages through content analysis</title><description>Detecting spam web pages through content analysis</description><link>http://www.bibsonomy.org/bibtex/2c93f4228fd8552bede071569cdaa1ad9/beate</link><dc:creator>beate</dc:creator><dc:date>2008-04-07T10:41:46+02:00</dc:date><dc:subject>spam features web </dc:subject><content:encoded>&lt;span style=&#034;color:#555555;&#034;&gt;Alexandros &lt;a href=&#034;http://www.bibsonomy.org/author/Ntoulas&#034;&gt;Ntoulas&lt;/a&gt;  und Marc &lt;a href=&#034;http://www.bibsonomy.org/author/Najork&#034;&gt;Najork&lt;/a&gt;  und Mark &lt;a href=&#034;http://www.bibsonomy.org/author/Manasse&#034;&gt;Manasse&lt;/a&gt;  und Dennis &lt;a href=&#034;http://www.bibsonomy.org/author/Fetterly&#034;&gt;Fetterly&lt;/a&gt;  &lt;/span&gt;&lt;em&gt;WWW &#039;06: Proceedings of the 15th international conference on World Wide Web, &lt;/em&gt;&lt;em&gt;Seite83--92. &lt;/em&gt;&lt;em&gt;New York, NY, USA, &lt;/em&gt;&lt;em&gt;ACM, &lt;/em&gt;(&lt;em&gt;2006&lt;/em&gt;)</content:encoded><taxo:topics><rdf:Bag><rdf:li rdf:resource="http://www.bibsonomy.org/tag/spam"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/features"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/web"/></rdf:Bag></taxo:topics><burst:publication><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2c93f4228fd8552bede071569cdaa1ad9/beate"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2c93f4228fd8552bede071569cdaa1ad9/beate"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://portal.acm.org/citation.cfm?id=1135794"/><swrc:date>Mon Apr 07 10:41:46 CEST 2008</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:booktitle>WWW &#039;06: Proceedings of the 15th international conference on World Wide Web</swrc:booktitle><swrc:pages>83--92</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>Detecting spam web pages through content analysis</swrc:title><swrc:year>2006</swrc:year><swrc:keywords>spam features web </swrc:keywords><swrc:abstract>In this paper, we continue our investigations of &#034;web spam&#034;: the injection of artificially-created pages into the web in order to influence the results from search engines, to drive traffic to certain pages for fun or profit. This paper considers some previously-undescribed techniques for automatically detecting spam pages, examines the effectiveness of these techniques in isolation and when aggregated using classification algorithms. When combined, our heuristics correctly identify 2,037 (86.2%) of the 2,364 spam pages (13.8%) in our judged collection of 17,168 pages, while misidentifying 526 spam and non-spam pages (3.1%).</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="Edinburgh, Scotland" swrc:key="location"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="1-59593-323-9" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="http://doi.acm.org/10.1145/1135777.1135794" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Alexandros Ntoulas"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Marc Najork"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Mark Manasse"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Dennis Fetterly"/></rdf:_4></rdf:Seq></swrc:author></rdf:Description></burst:publication></item></rdf:RDF>