<rdf:RDF xmlns:burst="http://xmlns.com/burst/0.1/" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"><channel rdf:about="http://www.bibsonomy.org/burst/user/hotho/www"><title>BibSonomy publications for /user/hotho/www</title><link>http://www.bibsonomy.org/burst/user/hotho/www</link><description>BibSonomy BuRST Feed for /user/hotho/www</description><dc:date>2008-10-07T18:40:41+02:00</dc:date><items><rdf:Seq><rdf:li rdf:resource="http://www.bibsonomy.org/bibtex/2480a63c3e6847dc8a9ebd3de040501db/hotho"/><rdf:li rdf:resource="http://www.bibsonomy.org/bibtex/27d3c70d55c118425216a7375f749c2f2/hotho"/><rdf:li rdf:resource="http://www.bibsonomy.org/bibtex/2e515dc2a8adbc7fa84b7fe968b61391e/hotho"/></rdf:Seq></items></channel><item rdf:about="http://www.bibsonomy.org/bibtex/2480a63c3e6847dc8a9ebd3de040501db/hotho"><title>Extraction and Classification of Dense Communities in the WebAuthors</title><description>WWW2007 Paper Details</description><link>http://www.bibsonomy.org/bibtex/2480a63c3e6847dc8a9ebd3de040501db/hotho</link><dc:creator>hotho</dc:creator><dc:date>2007-05-10T00:12:21+02:00</dc:date><dc:subject>graph clustering 2007 www </dc:subject><content:encoded>&lt;span style=&#034;color:#555555;&#034;&gt;Yon &lt;a href=&#034;http://www.bibsonomy.org/author/Dourisboure&#034;&gt;Dourisboure&lt;/a&gt;  und Filippo &lt;a href=&#034;http://www.bibsonomy.org/author/Geraci&#034;&gt;Geraci&lt;/a&gt;  und Marco &lt;a href=&#034;http://www.bibsonomy.org/author/Pellegrini&#034;&gt;Pellegrini&lt;/a&gt;  &lt;/span&gt;&lt;em&gt;Proc of the wwww, &lt;/em&gt;(&lt;em&gt;2007&lt;/em&gt;)</content:encoded><taxo:topics><rdf:Bag><rdf:li rdf:resource="http://www.bibsonomy.org/tag/graph"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/clustering"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/2007"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/www"/></rdf:Bag></taxo:topics><burst:publication><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2480a63c3e6847dc8a9ebd3de040501db/hotho"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2480a63c3e6847dc8a9ebd3de040501db/hotho"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://www2007.org/program/paper.php?id=15"/><swrc:date>Thu May 10 00:12:21 CEST 2007</swrc:date><swrc:booktitle>Proc of the wwww</swrc:booktitle><swrc:title>Extraction and Classification of Dense Communities in the WebAuthors</swrc:title><swrc:year>2007</swrc:year><swrc:keywords>graph clustering 2007 www </swrc:keywords><swrc:abstract>The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understanding collective behaviors and emerging phenomena in the WWW. In this paper we focus on the problem of searching and classifying {\em communities} in the web. Loosely speaking a community is a group of pages related to a common interest. More formally communities have been associated in the computer science literature with the existence of a locally dense sub-graph of the web-graph (where web pages are nodes and hyper-links are arcs of the web-graph). The core of our contribution is a new scalable algorithm for finding relatively dense subgraphs in massive graphs. We apply our algorithm on web-graphs built on three publicly available large crawls of the web (with raw sizes up to 120M nodes and 1G arcs). The effectiveness of our algorithm in finding dense subgraphs is demonstrated experimentally by embedding artificial communities in the web-graph and counting how many of these are blindly found. Effectiveness increases with the size and density of the communities: it is close to 100\% for communities of a thirty nodes or more (even at low density). It is still about 80\% even for communities of twenty nodes with density over $50\%$ of the arcs present. At the lower extremes the algorithm catches 35\% of dense communities made of ten nodes. We complete our Community Watch system by clustering the communities found in the web-graph into homogeneous groups by topic and labelling each group by representative keywords.</swrc:abstract><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Yon Dourisboure"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Filippo Geraci"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Marco Pellegrini"/></rdf:_3></rdf:Seq></swrc:author></rdf:Description></burst:publication></item><item rdf:about="http://www.bibsonomy.org/bibtex/27d3c70d55c118425216a7375f749c2f2/hotho"><title>Finding related pages in the World Wide Web</title><link>http://www.bibsonomy.org/bibtex/27d3c70d55c118425216a7375f749c2f2/hotho</link><dc:creator>hotho</dc:creator><dc:date>2006-01-10T13:55:26+01:00</dc:date><dc:subject>find www page search </dc:subject><content:encoded>&lt;span style=&#034;color:#555555;&#034;&gt;J. &lt;a href=&#034;http://www.bibsonomy.org/author/Dean&#034;&gt;Dean&lt;/a&gt;  und M.R. &lt;a href=&#034;http://www.bibsonomy.org/author/Henzinger&#034;&gt;Henzinger&lt;/a&gt;  &lt;/span&gt;&lt;em&gt;Proceedings of the Eighth International World Wide Web Conference WWW-1999, &lt;/em&gt;&lt;em&gt;Toronto, &lt;/em&gt;&lt;em&gt;May1999. &lt;/em&gt;</content:encoded><taxo:topics><rdf:Bag><rdf:li rdf:resource="http://www.bibsonomy.org/tag/find"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/www"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/page"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/search"/></rdf:Bag></taxo:topics><burst:publication><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/27d3c70d55c118425216a7375f749c2f2/hotho"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/27d3c70d55c118425216a7375f749c2f2/hotho"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><swrc:date>Tue Jan 10 13:55:26 CET 2006</swrc:date><swrc:address>Toronto</swrc:address><swrc:booktitle>Proceedings of the Eighth International World Wide Web Conference WWW-1999</swrc:booktitle><swrc:month>May</swrc:month><swrc:title>Finding related pages in the World Wide Web</swrc:title><swrc:year>1999</swrc:year><swrc:keywords>find www page search </swrc:keywords><swrc:hasExtraField><swrc:Field swrc:value="90-74821-43-X" swrc:key="isbn"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="J. Dean"/></rdf:_1><rdf:_2><swrc:Person swrc:name="M.R. Henzinger"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description></burst:publication></item><item rdf:about="http://www.bibsonomy.org/bibtex/2e515dc2a8adbc7fa84b7fe968b61391e/hotho"><title>Data preparation for mining world wide web browsing patterns</title><link>http://www.bibsonomy.org/bibtex/2e515dc2a8adbc7fa84b7fe968b61391e/hotho</link><dc:creator>hotho</dc:creator><dc:date>2006-01-08T11:47:30+01:00</dc:date><dc:subject>mining pattern preparation data browsing www </dc:subject><content:encoded>&lt;span style=&#034;color:#555555;&#034;&gt;R. &lt;a href=&#034;http://www.bibsonomy.org/author/Cooley&#034;&gt;Cooley&lt;/a&gt;  und B. &lt;a href=&#034;http://www.bibsonomy.org/author/Mobasher&#034;&gt;Mobasher&lt;/a&gt;  und J. &lt;a href=&#034;http://www.bibsonomy.org/author/Srivastava&#034;&gt;Srivastava&lt;/a&gt;  &lt;/span&gt;&lt;em&gt;Journal of Knowledge and Information Systems&lt;/em&gt;&lt;em&gt;1(1):5--32&lt;/em&gt;(&lt;em&gt;1999&lt;/em&gt;)</content:encoded><taxo:topics><rdf:Bag><rdf:li rdf:resource="http://www.bibsonomy.org/tag/mining"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/pattern"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/preparation"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/data"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/browsing"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/www"/></rdf:Bag></taxo:topics><burst:publication><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2e515dc2a8adbc7fa84b7fe968b61391e/hotho"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2e515dc2a8adbc7fa84b7fe968b61391e/hotho"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><swrc:date>Sun Jan 08 11:47:30 CET 2006</swrc:date><swrc:journal>Journal of Knowledge and Information Systems</swrc:journal><swrc:number>1</swrc:number><swrc:pages>5--32</swrc:pages><swrc:title>Data preparation for mining world wide web browsing patterns</swrc:title><swrc:volume>1</swrc:volume><swrc:year>1999</swrc:year><swrc:keywords>mining pattern preparation data browsing www </swrc:keywords><swrc:hasExtraField><swrc:Field swrc:value="Santa Barbara, CA" swrc:key="location"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="R. Cooley"/></rdf:_1><rdf:_2><swrc:Person swrc:name="B. Mobasher"/></rdf:_2><rdf:_3><swrc:Person swrc:name="J. Srivastava"/></rdf:_3></rdf:Seq></swrc:author></rdf:Description></burst:publication></item></rdf:RDF>