dbenz > attended | BibSonomy

Lesezeichen (verstecken)26
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

3Evaluating Similarity Measures for Emergent Semantics of Social Tagging - WWW2009 EPrints
Social bookmarking systems and their emergent information structures, known as folksonomies, are increasingly important data sources for Semantic Web applications. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as navigation support, semantic search, and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures derived from established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focus on similarity among tags and resources, considering different ways to aggregate annotations across users. After comparing how tag similarity measures predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory. We also investigate the issue of scalability. We ﬁnd that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable; per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity.
vor 15 Jahren von @dbenz
alle anzeigen
attended
itegpub
myown
www2009
attendeditegpubmyownwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1WWW2009: Social Web Camp
http://www2009.eprints.org/programreg.html#W3Csocial
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Releasing Search Queries and Clicks Privately - WWW2009 EPrints
http://www2009.eprints.org/18/
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Mining the Web 2.0 for Better Search - WWW2009 EPrints
There are several semantic sources that can be found in the Web that are either explicit, e.g. Wikipedia, or implicit, e.g. derived from Web usage data. Most of them are related to user generated content (UGC) or what is called today the Web 2.0. In this talk we show several applications of mining the wisdom of crowds behind UGC to improve search. We will show live demos to find relations in the Wikipedia or to improve image search as well as our current research in the topic. Our final goal is to produce a virtuous data feedback circuit to leverage the Web itself.
vor 15 Jahren von @dbenz
alle anzeigen
attended
baeza_yates
keynote
mining
web2.0
www2009
attendedbaeza_yateskeynoteminingweb2.0www2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Tag Ranking - WWW2009 EPrints
Social media sharing web sites like Flickr allow users to annotate images with free tags, which signiﬁcantly facilitate Web image search and organization. However, the tags associated with an image generally are in a random order without any importance or relevance information, which limits the effectiveness of these tags in search and other applications. In this paper, we propose a tag ranking scheme, aiming to automatically rank the tags associated with a given image according to their relevance to the image content. We ﬁrst estimate initial relevance scores for the tags based on probability density estimation, and then perform a random walk over a tag similarity graph to reﬁne the relevance scores. Experimental results on a 50, 000 Flickr photo collection show that the proposed tag ranking method is both effective and efficient. We also apply tag ranking into three applications: (1) tag-based image search, (2) tag recommendation, and (3) group recommendation, which demonstrates that the proposed tag ranking approach really boosts the performances of social-tagging related applications.
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Learning to Tag - WWW2009 EPrints
Social tagging provides valuable and crucial information for large-scale web image retrieval. It is ontology-free and easy to obtain; however, irrelevant tags frequently appear, and users typically will not tag all semantic objects in the image, which is also called semantic loss. To avoid noises and compensate for the semantic loss, tag recommendation is proposed in literature. However, current recommendation simply ranks the related tags based on the single modality of tag co-occurrence on the whole dataset, which ignores other modalities, such as visual correlation. This paper proposes a multi-modality recommendation based on both tag and visual correlation, and formulates the tag recommendation as a learning problem. Each modality is used to generate a ranking feature, and Rankboost algorithm is applied to learn an optimal combination of these ranking features from different modalities. Experiments on Flickr data demonstrate the effectiveness of this learning-based multi-modality recommendation strategy.
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Visual Diversification of Image Search Results - WWW2009 EPrints
Due to the reliance on the textual information associated with an image, image search engines on the Web lack the discriminative power to deliver visually diverse search results. The textual descriptions are key to retrieve relevant results for a given user query, but at the same time provide little information about the rich image content. In this paper we investigate three methods for visual diversiﬁcation of image search results. The methods deploy lightweight clustering techniques in combination with a dynamic weighting function of the visual features, to best capture the discriminative aspects of the resulting set of images that is retrieved. A representative image is selected from each cluster, which together form a diverse result set. Based on a performance evaluation we ﬁnd that the outcome of the methods closely resembles human perception of diversity, which was established in an extensive clustering experiment carried out by human assessors. models deployed on the Web and by these photo sharing sites rely heavily on search paradigms developed within the ﬁeld Information Retrieval. This way, image retrieval can beneﬁt from years of research experience, and the better this textual metadata captures the content of the image, the better the retrieval performance will be. It is also commonly acknowledged that a picture has to be seen to fully understand its meaning, signiﬁcance, beauty, or context, simply because it conveys information that words can not capture, or at least not in any practical setting. This explains the large number of papers on content-based image retrieval (CBIR) that has been published since 1990, the breathtaking publication rates since 1997 [12], and the continuing interest in the ﬁeld [4]. Moving on from simple low-level features to more discriminative descriptions, the ﬁeld has come a long way in narrowing down the semantic gap by using high-level semantics [8]. Unfortunately, CBIR-methods using higher level semantics usually require extensive training, intricate object ontologies or expensive construction of a visual dictionary, and their performance remains unﬁt for use in large scale online applications such as the aforementioned search engines or websites. Consequently, retrieval models operating in the textual metadata domain are therefore deployed here. In these applications, image search results are usually displayed in a ranked list. This ranking reﬂects the similarity of the image’s metadata to the textual query, according to the textual retrieval model of choice. There may exist two problems with this ranking. First, it may be lacking visual diversity. For instance, when a speciﬁc type or brand of car is issued as query, it may very well be that the top of this ranking displays many times the same picture that was released by the marketing division of the company. Similarly, pictures of a popular holiday destination tend to show the same touristic hot spot, often taken from the same angle and distance. This absence of visual diversity is due to the nature of the image annotation, which does not allow or motivate people to adequately describe the visual content of an image. Second, the query may have several aspects to it that are not sufficiently covered by the ranking. Perhaps the user is interested in a particular aspect of the query, but doesn’t know how to express this explicitly and issues a broader, more general query. It could also be that a query yields so many different results, that it’s hard to get an overview of the collection of relevant images in the database.
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1What Makes Conversations Interesting? Themes, Participants and Consequences of Conversations in Online Social Media - WWW2009 EPrints
Rich media social networks promote not only creation and consumption of media, but also communication about the posted media item. What causes a conversation to be interesting, that prompts a user to participate in the discussion on a posted video? We conjecture that people participate in conversations when they find the conversation theme interesting, see comments by people whom they are familiar with, or observe an engaging dialogue between two or more people (absorbing back and forth exchange of comments). Importantly, a conversation that is interesting must be consequential – i.e. it must impact the social network itself. Our framework has three parts. First, we detect conversational themes using a mixture model approach. Second, we determine interestingness of participants and interestingness of conversations based on a random walk model. Third, we measure the consequence of a conversation by measuring how interestingness affects the following three variables – participation in related themes, participant cohesiveness and theme diffusion. We have conducted extensive experiments using a dataset from the popular video sharing site, YouTube. Our results show that our method of interestingness maximizes the mutual information, and is significantly better (twice as large) than three other baseline methods (number of comments, number of new participants and PageRank based assessment). create (e.g. upload photo on Flickr), and consume media (e.g. watch a video on YouTube). These websites also allow for significant communication between the users – such as comments by one user on a media uploaded by another. These comments reveal a rich dialogue structure (user A comments on the upload, user B comments on the upload, A comments in response to B’s comment, B responds to A’s comment etc.) between users, where the discussion is often about themes unrelated to the original video. Example of a conversation from YouTube [1] is shown in Figure 1. In this paper, the sequence of comments on a media object is referred to as a conversation. Note the theme of the conversation is latent and depends on the content of the conversation. The fundamental idea explored in this paper is that analysis of communication activity is crucial to understanding repeated visits to a rich media social networking site. People return to a video post that they have already seen and post further comments (say in YouTube) in response to the communication activity, rather than to watch the video again. Thus it is the content of the communication activity itself that the people want to read (or see, if the response to a video post is another video, as is possible in the case of YouTube). Furthermore, these rich media sites have notification mechanisms that alert users of new comments on a video post / image upload promoting this communication activity.
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Mining Interesting Locations and Travel Sequences from GPS Trajectories
The increasing availability of GPS-enabled devices is changing the way people interact with the Web, and brings us a large amount of GPS trajectories representing people’s location histories. In this paper, based on multiple users’ GPS trajectories, we aim to mine interesting locations and classical travel sequences in a given geospatial region. Here, interesting locations mean the culturally important places, such as Tiananmen Square in Beijing, and frequented public areas, like shopping malls and restaurants, etc. Such information can help users understand surrounding locations, and would enable travel recommendation. In this work, we first model multiple individuals’ location histories with a tree-based hierarchical graph (TBHG). Second, based on the TBHG, we propose a HITS (Hypertext Induced Topic Search)-based inference model, which regards an individual’s access on a location as a directed link from the user to that location. This model infers the interest of a location by taking into account the following three factors. 1) The interest of a location depends on not only the number of users visiting this location but also these users’ travel experiences. 2) Users’ travel experiences and location interests have a mutual reinforcement relationship. 3) The interest of a location and the travel experience of a user are relative values and are region-related. Third, we mine the classical travel sequences among locations considering the interests of these locations and users’ travel experiences. We evaluated our system using a large GPS dataset collected by 107 users over a period of one year in the real world. As a result, our HITS-based inference model outperformed baseline approaches like rank-by-count and rank-by-frequency. Meanwhile, when considering the users’ travel experiences and location interests, we achieved a better performance beyond baselines, such as rank-by-count and rank-by-interest, etc.
vor 15 Jahren von @dbenz
alle anzeigen
attended
gps
mining
trajectory
www2009
attendedgpsminingtrajectorywww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Learning to Recognize Reliable Users and Content in Social Media with Coupled Mutual Reinforcement
Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yahoo! Answers have exploded in popularity, and now provide a viable alternative to general purpose Web search. At the same time, the answers to past questions submitted in CQA sites comprise a valuable knowledge repository which could be a gold mine for information retrieval and automatic question answering. Unfortunately, the quality of the submitted questions and answers varies widely - increasingly so that a large fraction of the content is not usable for answering queries. Previous approaches for retrieving relevant and high quality content have been proposed, but they require large amounts of manually labeled data – which limits the applicability of the supervised approaches to new sites and domains. In this paper we address this problem by developing a semi-supervised coupled mutual reinforcement framework for simultaneously calculating content quality and user reputation, that requires relatively few labeled examples to initialize the training process. Results of a large scale evaluation demonstrate that our methods are more effective than previous approaches for ﬁnding high-quality answers, questions, and users. More importantly, our quality estimation signiﬁcantly improves the accuracy of search over CQA archives over the state-of-the-art methods.
vor 15 Jahren von @dbenz
alle anzeigen
attended
www2009
attendedwww2009
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

BibSonomy

Lesezeichen (verstecken)26
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

3Evaluating Similarity Measures for Emergent Semantics of Social Tagging - WWW2009 EPrints

1WWW2009: Social Web Camp

1Releasing Search Queries and Clicks Privately - WWW2009 EPrints

1Mining the Web 2.0 for Better Search - WWW2009 EPrints

1Tag Ranking - WWW2009 EPrints

1Learning to Tag - WWW2009 EPrints

1Visual Diversification of Image Search Results - WWW2009 EPrints

1What Makes Conversations Interesting? Themes, Participants and Consequences of Conversations in Online Social Media - WWW2009 EPrints

1Mining Interesting Locations and Travel Sequences from GPS Trajectories

1Learning to Recognize Reliable Users and Content in Social Media with Coupled Mutual Reinforcement

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Stöbern

Verwandte Tags

Konzepte

Tags

Lesezeichen (verstecken)26 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

Publikationen (verstecken) Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Stöbern

Verwandte Tags

Tags

Lesezeichen (verstecken)26
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...