copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hyperincident connected components of tagging networks

N. Neubauer, and K. Obermayer. Proceedings of the 20th ACM conference on Hypertext and hypermedia, page 229--238. New York, NY, USA, ACM, (2009)
DOI: 10.1145/1557914.1557954

Abstract

Data created by social bookmarking systems can be described as 3-partite 3-uniform hypergraphs connecting documents, users, and tags (tagging networks), such that the toolbox of complex network analysis can be applied to examine their properties. One of the most basic tools, the analysis of connected components, however cannot be applied meaningfully: Tagging networks tend to be almost entirely connected. We therefore propose a generalization of connected components, m-hyperincident connected components. We show that decomposing tagging networks into 2-hyperincident connected components yields a characteristic component distribution with a salient giant component that can be found across various datasets. This pattern changes if the underlying formation process changes, for example, if the hypergraph is constructed from search logs, or if the tagging data is contaminated by spam: It turns out that the second- to 129th largest components of the spam-labeled Bibsonomy dataset are inhabited exclusively by spam users. Based on these findings, we propose and unsupervised method for spam detection.

Links and resources

BibTeX key: citeulike:5031993
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 20th ACM conference on Hypertext and hypermedia
year: 2009
pages: 229--238
publisher: ACM
series: HT '09
citeulike-article-id: 5031993
isbn: 978-1-60558-486-7
citeulike-linkout-1: http://dx.doi.org/10.1145/1557914.1557954
priority: 0
posted-at: 2009-07-01 12:40:36
citeulike-linkout-0: http://portal.acm.org/citation.cfm?id=1557914.1557954
comment: (private-note)Most interesting part - they suggested to build a different network from tagging network called 2-hyperincident, which connects tagging incidents if they are similar enough. In this network, they found that 2nd to 128th connected component are all spam. So, spam and legitimate links can be separated. However, it looks like this thing can be also found on a normal graphs - user-document. Interesting example of what could be done by link analysis. Maybe, we could use spread-activation and recommendation on this kinds of graphs.
location: Torino, Italy
DOI: 10.1145/1557914.1557954
url: http://dx.doi.org/10.1145/1557914.1557954

@aho's tags highlighted

Cite this publication

search on

Meta data

Last update 6 years ago
Created 6 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hyperincident connected components of tagging networks

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Hyperincident connected components of tagging networks

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Hyperincident connected components of tagging networks

Comments and Reviews
(0)