<rdf:RDF xmlns:community="http://www.bibsonomy.org/ontologies/2008/05/community#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xml:base="http://www.bibsonomy.org/user/jaeschke/information"><owl:Ontology rdf:about=""><rdfs:comment>BibSonomy publications for /user/jaeschke/information</rdfs:comment><owl:imports rdf:resource="http://swrc.ontoware.org/ontology/portal"/></owl:Ontology><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2d44d1c9a48f5b676388ffbc90c7577ba/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2d44d1c9a48f5b676388ffbc90c7577ba/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://doi.acm.org/10.1145/1379092.1379110"/><swrc:date>Mon Dec 05 18:36:22 CET 2011</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:booktitle>Proceedings of the nineteenth ACM conference on Hypertext and hypermedia</swrc:booktitle><swrc:pages>81--88</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>Understanding the efficiency of social tagging systems using information theory</swrc:title><swrc:year>2008</swrc:year><swrc:keywords>collaborative folksonomy information social tagging theory </swrc:keywords><swrc:abstract>Given the rise in popularity of social tagging systems, it seems only natural to ask how efficient is the organically evolved tagging vocabulary in describing underlying document objects? Does this distributed process really provide a way to circumnavigate the traditional &#034;vocabulary problem&#034; with ontology? We analyze a social tagging site, namely del.icio.us, with information theory in order to evaluate the efficiency of this social tagging site for encoding navigation paths to information sources. We show that information theory provides a natural and interesting way to understand this efficiency - or the descriptive, encoding power of tags. Our results indicate the efficiency of tags appears to be waning. We discuss the implications of our findings and provide insight into how our methods can be used to design more usable social tagging software.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="1379110" swrc:key="acmid"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="Pittsburgh, PA, USA" swrc:key="location"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="978-1-59593-985-2" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="8" swrc:key="numpages"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1145/1379092.1379110" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Ed H. Chi"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Todd Mytkowicz"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2e64d14f3207766f4afc65983fa759ffe/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2e64d14f3207766f4afc65983fa759ffe/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://www.kde.cs.uni-kassel.de/pub/pdf/krause2008logsonomy.pdf"/><swrc:date>Thu Jan 27 12:08:28 CET 2011</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:booktitle>HT &#039;08: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia</swrc:booktitle><swrc:pages>157--166</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>Logsonomy - Social Information Retrieval with Logdata</swrc:title><swrc:year>2008</swrc:year><swrc:keywords>2008 engine information l3s logsonomy myown retrieval search wp5 analysis network sna social </swrc:keywords><swrc:abstract>Social bookmarking systems constitute an established part of the Web 2.0. In such systems users describe bookmarks by keywords called tags. The structure behind these social systems, called folksonomies, can be viewed as a tripartite hypergraph of user, tag and resource nodes. This underlying network shows specific structural properties that explain its growth and the possibility of serendipitous exploration.
Today’s search engines represent the gateway to retrieve information from the World Wide Web. Short queries typically consisting of two to three words describe a user’s information need. In response to the displayed results of the search engine, users click on the links of the result page as they expect
the answer to be of relevance. 
This clickdata can be represented as a folksonomy in which queries are descriptions of
clicked URLs. The resulting network structure, which we will term logsonomy is very
similar to the one of folksonomies. In order to find out about its properties, we analyze
the topological characteristics of the tripartite hypergraph of queries, users and bookmarks
on a large snapshot of del.icio.us and on query logs of two large search engines.
All of the three datasets show small world properties. The tagging behavior of users,
which is explained by preferential attachment of the tags in social bookmark systems, is
reflected in the distribution of single query words in search engines. We can conclude
that the clicking behaviour of search engine users based on the displayed search results
and the tagging behaviour of social bookmarking users is driven by similar dynamics.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="Pittsburgh, PA, USA" swrc:key="location"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="978-1-59593-985-2" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="17" swrc:key="vgwort"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1145/1379092.1379123" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Beate Krause"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Robert Jäschke"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Andreas Hotho"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Gerd Stumme"/></rdf:_4></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2a25702677dc406b1be7878215277050c/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2a25702677dc406b1be7878215277050c/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#PhDThesis"/><swrc:date>Wed Jan 26 12:03:05 CET 2011</swrc:date><swrc:school><swrc:University swrc:name="Universität Düsseldorf"/></swrc:school><swrc:title>Folksonomies in Wissensrepräsentation und Information Retrieval</swrc:title><swrc:type>PhD thesis</swrc:type><swrc:year>2009</swrc:year><swrc:keywords>folksonomy information knowledge representation retrieval wissen </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Isabella Peters"/></rdf:_1></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/216fa5c5fc155e296ff885bf62d1a1230/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/216fa5c5fc155e296ff885bf62d1a1230/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><owl:sameAs rdf:resource="http://www.phil-fak.uni-duesseldorf.de/infowiss/mitarbeiter/wissenschaftliche-mitarbeiter-hilfskraefte/isabella-peters/012-folksonomies-in-wissensrepraesentation-und-information-retrieval/"/><swrc:date>Wed Jan 26 11:41:50 CET 2011</swrc:date><swrc:journal>Information - Wissenschaft und Praxis</swrc:journal><swrc:number>2</swrc:number><swrc:pages>77--90</swrc:pages><swrc:title>Folksonomies in Wissensrepräsentation und Information Retrieval</swrc:title><swrc:volume>59</swrc:volume><swrc:year>2008</swrc:year><swrc:keywords>folksonomy information representation retrieval wissen </swrc:keywords><swrc:abstract>Folksonomies in Wissensrepräsentation und Information Retrieval.
Die populären Web 2.0-Dienste werden von Prosumern – Produzenten und gleichsam Konsumenten – nicht nur dazu genutzt, Inhalte zu produzieren, sondern auch, um sie inhaltlich zu erschließen. Folksonomies erlauben es dem Nutzer, Dokumente mit eigenen Schlagworten, sog. Tags, zu beschreiben, ohne dabei auf gewisse Regeln oder Vorgaben achten zu müssen. Neben einigen Vorteilen zeigen Folksonomies aber auch zahlreiche Schwächen (u. a. einen Mangel an Präzision). Um diesen Nachteilen größtenteils entgegenzuwirken, schlagen wir eine Interpretation der Tags als natürlichsprachige Wörter vor. Dadurch ist es uns möglich, Methoden des Natural Language Processing (NLP) auf die Tags anzuwenden und so linguistische Probleme der Tags zu beseitigen. Darüber hinaus diskutieren wir Ansätze und weitere Vorschläge (Tagverteilungen, Kollaboration und akteurspezifische Aspekte) hinsichtlich eines Relevance Rankings von getaggten Dokumenten. Neben Vorschlägen auf ähnliche Dokumente („more like this!“) erlauben Folksonomies auch Hinweise auf verwandte Nutzer und damit auf Communities („more like me!“).

Folksonomies in Knowledge Representation and Information Retrieval
In Web 2.0 services “prosumers” – producers and consumers – collaborate not only for the purpose of creating content, but to index these pieces of information as well. Folksonomies permit actors to describe documents with subject headings, “tags“, without regarding any rules. Apart from a lot of benefits folksonomies have many shortcomings (e.g., lack of precision). In order to solve some of the problems we propose interpreting tags as natural language terms. Accordingly, we can introduce methods of NLP to solve the tags’ linguistic problems. Additionally, we present criteria for tagged documents to create a ranking by relevance (tag distribution, collaboration and actor-based aspects). Besides recommending similar documents („more like this!“) folksonomies can be used for the recommendation of similar users and communities („more like me!“).</swrc:abstract><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Isabella Peters"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Wolfgang G. Stock"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/29c028ebcb336380cb02e2a4beaa14d54/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/29c028ebcb336380cb02e2a4beaa14d54/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Book"/><owl:sameAs rdf:resource="http://www.amazon.com/Visual-Display-Quantitative-Information-2nd/dp/0961392142%3FSubscriptionId%3D192BW6DQ43CK9FN0ZGG2%26tag%3Dws%26linkCode%3Dxm2%26camp%3D2025%26creative%3D165953%26creativeASIN%3D0961392142"/><swrc:date>Tue Nov 17 11:19:31 CET 2009</swrc:date><swrc:edition>Second</swrc:edition><swrc:publisher><swrc:Organization swrc:name="Graphics Press"/></swrc:publisher><swrc:title>The Visual Display of Quantitative Information</swrc:title><swrc:year>2001</swrc:year><swrc:keywords>data information mining toread visualization </swrc:keywords><swrc:hasExtraField><swrc:Field swrc:value="9780961392147" swrc:key="ean"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="0961392142" swrc:key="asin"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="0961392142" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="001.4226" swrc:key="dewey"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Edward R. Tufte"/></rdf:_1></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/212176d90012ed75f57996af0b9240d02/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/212176d90012ed75f57996af0b9240d02/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><owl:sameAs rdf:resource="http://portal.acm.org/citation.cfm?id=582418"/><swrc:date>Tue Nov 03 15:41:30 CET 2009</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:journal>ACM Transactions on Information Systems</swrc:journal><swrc:month>oct</swrc:month><swrc:number>4</swrc:number><swrc:pages>422--446</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>Cumulated gain-based evaluation of IR techniques</swrc:title><swrc:volume>20</swrc:volume><swrc:year>2002</swrc:year><swrc:keywords>evaluation information ir retrieval </swrc:keywords><swrc:abstract>Modern large retrieval environments tend to overwhelm their users by their large output. Since all documents are not of equal relevance to their users, highly relevant documents should be identified and ranked first for presentation. In order to develop IR techniques in this direction, it is necessary to develop evaluation approaches and methods that credit IR methods for their ability to retrieve highly relevant documents. This can be done by extending traditional evaluation methods, that is, recall and precision based on binary relevance judgments, to graded relevance judgments. Alternatively, novel measures based on graded relevance judgments may be developed. This article proposes several novel measures that compute the cumulative gain the user obtains by examining the retrieval result up to a given ranked position. The first one accumulates the relevance scores of retrieved documents along the ranked result list. The second one is similar but applies a discount factor to the relevance scores in order to devaluate late-retrieved documents. The third one computes the relative-to-the-ideal performance of IR techniques, based on the cumulative gain they are able to yield. These novel measures are defined and discussed and their use is demonstrated in a case study using TREC data: sample system run results for 20 queries in TREC-7. As a relevance base we used novel graded relevance judgments on a four-point scale. The test results indicate that the proposed measures credit IR methods for their ability to retrieve highly relevant documents and allow testing of statistical significance of effectiveness differences. The graphs based on the measures also provide insight into the performance IR techniques and allow interpretation, for example, from the user point of view.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="1046-8188" swrc:key="issn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1145/582415.582418" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Kalervo Järvelin"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Jaana Kekäläinen"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/212592d5f805db5bd127ee5abae1a4325/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/212592d5f805db5bd127ee5abae1a4325/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://portal.acm.org/citation.cfm?id=345545"/><swrc:date>Tue Nov 03 13:28:34 CET 2009</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:booktitle>SIGIR &#039;00: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</swrc:booktitle><swrc:pages>41--48</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>IR evaluation methods for retrieving highly relevant documents</swrc:title><swrc:year>2000</swrc:year><swrc:keywords>evaluation information ir retrieval </swrc:keywords><swrc:abstract>This paper proposes evaluation methods based on the use of non-dichotomous relevance judgements in IR experiments. It is argued that evaluation methods should credit IR methods for their ability to retrieve highly relevant documents. This is desirable from the user point of view in modern large IR environments. The proposed methods are (1) a novel application of P-R curves and average precision computations based on separate recall bases for documents of different degrees of relevance, and (2) two novel measures computing the cumulative gain the user obtains by examining the retrieval result up to a given ranked position. We then demonstrate the use of these evaluation methods in a case study on the effectiveness of query types, based on combinations of query structures and expansion, in retrieving documents of various degrees of relevance. The test was run with a best match retrieval system (In-Query1) in a text database consisting of newspaper articles. The results indicate that the tested strong query structures are most effective in retrieving highly relevant documents. The differences between the query types are practically essential and statistically significant. More generally, the novel evaluation methods and the case demonstrate that non-dichotomous relevance assessments are applicable in IR experiments, may reveal interesting phenomena, and allow harder testing of IR methods.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="Athens, Greece" swrc:key="location"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="1-58113-226-3" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1145/345508.345545" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Kalervo Järvelin"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Jaana Kekäläinen"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2ead0b4af17c94074fe1c774d2f267617/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2ead0b4af17c94074fe1c774d2f267617/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Book"/><owl:sameAs rdf:resource="http://portal.acm.org/citation.cfm?id=553876"/><swrc:date>Sat Oct 17 10:08:47 CEST 2009</swrc:date><swrc:address>Boston, MA, USA</swrc:address><swrc:publisher><swrc:Organization swrc:name="Addison-Wesley Longman Publishing Co., Inc."/></swrc:publisher><swrc:title>Modern Information Retrieval</swrc:title><swrc:year>1999</swrc:year><swrc:keywords>book information ir retrieval web </swrc:keywords><swrc:abstract>This is a rigorous and complete textbook for a first course on information retrieval from the computer science (as opposed to a user-centred) perspective. The advent of the Internet and the enormous increase in volume of electronically stored information generally has led to substantial work on IR from the computer science perspective - this book provides an up-to-date student oriented treatment of the subject.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="020139829X" swrc:key="isbn"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Ricardo A. Baeza-Yates"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Berthier Ribeiro-Neto"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2dfa880e6d3e33d0aeb357396fb1833cd/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2dfa880e6d3e33d0aeb357396fb1833cd/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://portal.acm.org/citation.cfm?id=1379110&amp;coll=GUIDE&amp;dl=GUIDE&amp;CFID=37458772&amp;CFTOKEN=13998061&amp;ret=1"/><swrc:date>Mon Aug 03 10:04:08 CEST 2009</swrc:date><swrc:address>New York, NY, USA</swrc:address><swrc:booktitle>HT &#039;08: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia</swrc:booktitle><swrc:pages>81--88</swrc:pages><swrc:publisher><swrc:Organization swrc:name="ACM"/></swrc:publisher><swrc:title>Understanding the efficiency of social tagging systems using information theory</swrc:title><swrc:year>2008</swrc:year><swrc:keywords>bookmarking efficiency folksonomy information social tagging theory </swrc:keywords><swrc:abstract>Given the rise in popularity of social tagging systems, it seems only natural to ask how efficient is the organically evolved tagging vocabulary in describing underlying document objects? Does this distributed process really provide a way to circumnavigate the traditional &#034;vocabulary problem&#034; with ontology? We analyze a social tagging site, namely del.icio.us, with information theory in order to evaluate the efficiency of this social tagging site for encoding navigation paths to information sources. We show that information theory provides a natural and interesting way to understand this efficiency - or the descriptive, encoding power of tags. Our results indicate the efficiency of tags appears to be waning. We discuss the implications of our findings and provide insight into how our methods can be used to design more usable social tagging software.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="Pittsburgh, PA, USA" swrc:key="location"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="978-1-59593-985-2" swrc:key="isbn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1145/1379092.1379110" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Ed H. Chi"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Todd Mytkowicz"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/297d3915ead822cfa033fc821b424e437/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/297d3915ead822cfa033fc821b424e437/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Article"/><owl:sameAs rdf:resource="http://www.emeraldinsight.com/10.1108/00242530610667558"/><swrc:date>Mon Oct 27 09:14:27 CET 2008</swrc:date><swrc:journal>Library Review</swrc:journal><swrc:number>5</swrc:number><swrc:pages>291-300</swrc:pages><swrc:publisher><swrc:Organization swrc:name="Emerald Group Publishing Limited"/></swrc:publisher><swrc:title>Collaborative Tagging as a Knowledge Organisation and Resource Discovery Tool</swrc:title><swrc:volume>55</swrc:volume><swrc:year>2006</swrc:year><swrc:keywords>bookmarking classification folksonomy information knowledge management retrieval social tagging </swrc:keywords><swrc:abstract>The purpose of the paper is to provide an overview of the collaborative tagging phenomenon and explore some of the reasons for its emergence. Design/methodology/approach - The paper reviews the related literature and discusses some of the problems associated with, and the potential of, collaborative tagging approaches for knowledge organisation and general resource discovery. A definition of controlled vocabularies is proposed and used to assess the efficacy of collaborative tagging. An exposition of the collaborative tagging model is provided and a review of the major contributions to the tagging literature is presented. Findings - There are numerous difficulties with collaborative tagging systems (e.g. low precision, lack of collocation, etc.) that originate from the absence of properties that characterise controlled vocabularies. However, such systems can not be dismissed. Librarians and information professionals have lessons to learn from the interactive and social aspects exemplified by collaborative tagging systems, as well as their success in engaging users with information management. The future co-existence of controlled vocabularies and collaborative tagging is predicted, with each appropriate for use within distinct information contexts: formal and informal. Research limitations/implications - Librarians and information professional researchers should be playing a leading role in research aimed at assessing the efficacy of collaborative tagging in relation to information storage, organisation, and retrieval, and to influence the future development of collaborative tagging systems. Practical implications - The paper indicates clear areas where digital libraries and repositories could innovate in order to better engage users with information. Originality/value - At time of writing there were no literature reviews summarising the main contributions to the collaborative tagging research or debate.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="0024-2535" swrc:key="issn"/></swrc:hasExtraField><swrc:hasExtraField><swrc:Field swrc:value="10.1108/00242530610667558" swrc:key="doi"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="George Macgregor"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Emma McCulloch"/></rdf:_2></rdf:Seq></swrc:author><swrc:editor><rdf:Seq><rdf:_1><swrc:Person swrc:name="David McMenemy"/></rdf:_1></rdf:Seq></swrc:editor></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/23c301945817681d637ee43901c016939/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/23c301945817681d637ee43901c016939/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><swrc:date>Thu Feb 01 14:04:37 CET 2007</swrc:date><swrc:address>Heidelberg</swrc:address><swrc:booktitle>The Semantic Web: Research and Applications</swrc:booktitle><swrc:month>June</swrc:month><swrc:pages>411-426</swrc:pages><swrc:publisher><swrc:Organization swrc:name="Springer"/></swrc:publisher><swrc:series>Lecture Notes in Computer Science</swrc:series><swrc:title>Information Retrieval in Folksonomies: Search and Ranking</swrc:title><swrc:volume>4011</swrc:volume><swrc:year>2006</swrc:year><swrc:keywords>2006 folkrank folksonomy graph iccs_example information l3s mining myown ol_tut2010 rank ranking retrieval search seminar2006 trias_example webzu pagerank </swrc:keywords><swrc:abstract>Social bookmark tools are rapidly emerging on the Web. In such systems users are setting up lightweight conceptual structures called folksonomies. The reason for their immediate success is the fact that no specific skills are needed for participating. At the moment, however, the information retrieval support is limited. We present a formal model and a new search algorithm for folksonomies,
called FolkRank, that exploits the structure of the folksonomy. The proposed algorithm is also applied to find
communities within the folksonomy and is used to structure search results. All findings are demonstrated on a large scale dataset.</swrc:abstract><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Andreas Hotho"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Robert Jäschke"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Christoph Schmitz"/></rdf:_3><rdf:_4><swrc:Person swrc:name="Gerd Stumme"/></rdf:_4></rdf:Seq></swrc:author><swrc:editor><rdf:Seq><rdf:_1><swrc:Person swrc:name="York Sure"/></rdf:_1><rdf:_2><swrc:Person swrc:name="John Domingue"/></rdf:_2></rdf:Seq></swrc:editor></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/273b4dff8c6fac17b3ea377ed5b162540/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/273b4dff8c6fac17b3ea377ed5b162540/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><swrc:date>Wed May 10 16:00:48 CEST 2006</swrc:date><swrc:address>Washington, DC, USA</swrc:address><swrc:booktitle>JCDL &#039;03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries</swrc:booktitle><swrc:pages>49--60</swrc:pages><swrc:publisher><swrc:Organization swrc:name="IEEE Computer Society"/></swrc:publisher><swrc:title>Bibliographic attribute extraction from erroneous references based on a statistical model</swrc:title><swrc:year>2003</swrc:year><swrc:keywords>extraction information ie bibliographic </swrc:keywords><swrc:abstract>In this paper, we propose a method for extracting bibliographic attributes from reference strings captured using Optical Character Recognition (OCR) and an extended hidden Markov model. Bibliographic attribute extraction can be used in two ways. One is reference parsing in which attribute values are extracted from OCR-processed references for bibliographic matching. The other is reference alignment in which attribute values are aligned to the bibliographic record to enrich the vocabulary of the bibliographic database. In this paper, we first propose a statistical model for attribute extraction that represents both the syntactical structure of references and OCR error patterns. Then, we perform experiments using bibliographic references obtained from scanned images of papers in journals and transactions and show that useful attribute values are extracted from OCR-processed references. We also show that the proposed model has advantages in reducing the cost of preparing training data, a critical problem in rule-based systems.</swrc:abstract><swrc:hasExtraField><swrc:Field swrc:value="0-7695-1939-3" swrc:key="isbn"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Atsuhiro Takasu"/></rdf:_1></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/28d04bc19e470fe4b98e15a27a1e6e7e9/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/28d04bc19e470fe4b98e15a27a1e6e7e9/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://dblp.uni-trier.de/db/conf/naacl/naacl2004.html#PengM04"/><swrc:date>Wed Apr 19 15:29:08 CEST 2006</swrc:date><swrc:booktitle>HLT-NAACL</swrc:booktitle><swrc:pages>329-336</swrc:pages><swrc:title>Accurate Information Extraction from Research Papers using Conditional Random Fields</swrc:title><swrc:year>2004</swrc:year><swrc:keywords>extraction information </swrc:keywords><swrc:hasExtraField><swrc:Field swrc:value="http://acl.ldc.upenn.edu/hlt-naacl2004/main/pdf/176_Paper.pdf" swrc:key="ee"/></swrc:hasExtraField><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Fuchun Peng"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Andrew McCallum"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/26dbb7b45a3a53997359a5e3c2677dc52/jaeschke"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/26dbb7b45a3a53997359a5e3c2677dc52/jaeschke"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Unpublished"/><owl:sameAs rdf:resource="http://mallet.cs.umass.edu"/><swrc:date>Tue Apr 11 14:59:28 CEST 2006</swrc:date><swrc:note>http://mallet.cs.umass.edu</swrc:note><swrc:title>{MALLET: A Machine Learning for Language Toolkit}</swrc:title><swrc:year>2002</swrc:year><swrc:keywords>extraction learning information mallet machine </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Andrew Kachites McCallum"/></rdf:_1></rdf:Seq></swrc:author></rdf:Description></rdf:RDF>
