Article,

Evolution of document networks

F. Menczer.
Proceedings of the National Academy of Sciences of the United States of America, 101 (Suppl 1): 5261--5265 (April 2004)
DOI: 10.1073/pnas.0307554100

Abstract

10.1073/pnas.0307554100 How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.

BibTeX key: citeulike:885123
entry type: article
address: School of Informatics, Indiana University, Bloomington, IN 47408, USA. fil@indiana.edu
year: 2004
month: April
journal: Proceedings of the National Academy of Sciences of the United States of America
number: Suppl 1
pages: 5261--5265
volume: 101
issn: 0027-8424
posted-at: 2009-07-14 15:36:16
citeulike-article-id: 885123
citeulike-linkout-1: http://www.pnas.org/content/101/suppl.1/5261.abstract
citeulike-linkout-2: http://www.pnas.org/content/101/suppl.1/5261.full.pdf
citeulike-linkout-3: http://view.ncbi.nlm.nih.gov/pubmed/14747653
citeulike-linkout-4: http://www.hubmed.org/display.cgi?uids=14747653
priority: 2
citeulike-linkout-0: http://dx.doi.org/10.1073/pnas.0307554100
DOI: 10.1073/pnas.0307554100
url: http://dx.doi.org/10.1073/pnas.0307554100

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{citeulike:885123, abstract = {10.1073/pnas.0307554100 How does a network of documents grow without centralized control? This question is becoming crucial as we try to explain the emergent scale-free topology of the World Wide Web and use link analysis to identify important information resources. Existing models of growing information networks have focused on the structure of links but neglected the content of nodes. Here I show that the current models fail to reproduce a critical characteristic of information networks, namely the distribution of textual similarity among linked documents. I propose a more realistic model that generates links by using both popularity and content. This model yields remarkably accurate predictions of both degree and similarity distributions in networks of web pages and scientific literature.}, added-at = {2009-07-14T16:37:11.000+0200}, address = {School of Informatics, Indiana University, Bloomington, IN 47408, USA. fil@indiana.edu}, author = {Menczer, Filippo}, biburl = {https://www.bibsonomy.org/bibtex/2a8f0c9bb4e799237ab682bf8f527e889/anneba}, citeulike-article-id = {885123}, citeulike-linkout-0 = {http://dx.doi.org/10.1073/pnas.0307554100}, citeulike-linkout-1 = {http://www.pnas.org/content/101/suppl.1/5261.abstract}, citeulike-linkout-2 = {http://www.pnas.org/content/101/suppl.1/5261.full.pdf}, citeulike-linkout-3 = {http://view.ncbi.nlm.nih.gov/pubmed/14747653}, citeulike-linkout-4 = {http://www.hubmed.org/display.cgi?uids=14747653}, description = {CiteULike: Evolution of document networks}, doi = {10.1073/pnas.0307554100}, interhash = {41b0a1a1df7211b02d6744b4589e8cd3}, intrahash = {a8f0c9bb4e799237ab682bf8f527e889}, issn = {0027-8424}, journal = {Proceedings of the National Academy of Sciences of the United States of America}, keywords = {authoring cocitation evolution network}, month = {April}, number = {Suppl 1}, pages = {5261--5265}, posted-at = {2009-07-14 15:36:16}, priority = {2}, timestamp = {2009-07-14T16:37:12.000+0200}, title = {Evolution of document networks}, url = {http://dx.doi.org/10.1073/pnas.0307554100}, volume = 101, year = 2004 }

BibSonomy

Evolution of document networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on