When former U.S. Senator Rick Santorum started gearing up to launch his presidential campaign earlier this year, there was one question he could not avoid. It had to do with the matter of alt-weekly editor and advice columnist Dan Savage, who has for years positioned himself as Santorum’s most prominent critic. Many politicians have fierce opponents, but few did what Savage did in 2003, and that was hold a contest to give an alternate meaning to the word “santorum”. I hope you’ll forgive me for declining to quote the winning definition, but you can find it here, and suffice to say that it has stuck. So much so, in fact, that eight years later Savage’s term has come to dominate the web search results for Rick Santorum’s name.
A reconciliation service is a web service that, given some text which is a name or label for something, and optionally some additional details, returns a ranked list of potential entities matching the criteria. The candidate text does not have to match each entity's official name perfectly, and that's the whole point of reconciliation--to get from ambiguous text name to precisely identified entities. For instance, given the text "apple", a reconciliation service probably should return the Apple Inc. company, the apple fruit, and New York city (also known as the Big Apple).
Pearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across the curriculum, providing solutions for teachers and students.
Wednesday, August 08, 2007
Using Wikipedia to disambiguate names
Silviu Cucerzan at Microsoft Research recently published a paper, "Large-Scale Named Entity Disambiguation Based on Wikipedia Data"
There are now many computer programs for automatically determining the sense of a word in context (Word Sense Disambiguation or WSD). The purpose of Senseval is to evaluate the strengths and weaknesses of such programs with respect to different words, dif
The proposal Algorithms for Linguistic Processing focuses on two crucial problem areas in computational linguistics: problems of processing efficiency and ambiguity.
This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and
This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and
Lexical ambiguity is a fundamental problem in Information Retrieval (IR), especially in the medical domain. Many systems use a subset of the words contained in the document to represent the content, but they are faced with the problem of ambiguity.
Lexical ambiguity is a fundamental problem in Information Retrieval (IR), especially in the medical domain. Many systems use a subset of the words contained in the document to represent the content, but they are faced with the problem of ambiguity.
Collection of Ambiguous Statements. Helps Clarify the Enormity of the Semantic Web Project, in Which Machines Must Make Sense of Humanababble, or Folksononsense...
Collection of Ambiguous Statements. Helps Clarify the Enormity of the Semantic Web Project, in Which Machines Must Make Sense of Humanababble, or Folksononsense...
Lexical ambiguity arises when context is insufficient to determine the sense of a single word that has more than one meaning. Syntactic ambiguity arises when a sentence can be parsed in more than one way. Semantic ambiguity arises when a word or concept
Lexical ambiguity arises when context is insufficient to determine the sense of a single word that has more than one meaning. Syntactic ambiguity arises when a sentence can be parsed in more than one way. Semantic ambiguity arises when a word or concept
In general, a namespace is an abstract container providing context for the items (names, or technical terms, or words) it holds and allows disambiguation of items having the same name...As a rule, names in a namespace cannot have more than one meaning, th
In general, a namespace is an abstract container providing context for the items (names, or technical terms, or words) it holds and allows disambiguation of items having the same name...As a rule, names in a namespace cannot have more than one meaning, th
A meme ID is like a magic bullet in the document saying "I am about this meme." Can we construct a general taxonomy of memes, and then specialized lists of meme IDs that authors will feel comfortable adding to their documents?
A meme ID is like a magic bullet in the document saying "I am about this meme." Can we construct a general taxonomy of memes, and then specialized lists of meme IDs that authors will feel comfortable adding to their documents?
A memespace has a unique alphanumeric identifier to disambiguate it from other memespaces. The present design for meme IDs is: MEMESPACE-TAXOSPACE-ID. Essentially, it's another controlled vocabulary...
A memespace has a unique alphanumeric identifier to disambiguate it from other memespaces. The present design for meme IDs is: MEMESPACE-TAXOSPACE-ID. Essentially, it's another controlled vocabulary...
L. Wiechetek, K. Unhammer, и S. Moshagen. Workshop on the Use of Computational Methods in the Study of Endangered Languages, 1, стр. 46. ComputEL, (2019)
S. Liu, C. Yu, и W. Meng. Proceedings of the 14th ACM International Conference on Information and Knowledge Management, стр. 525--532. New York, NY, USA, ACM, (2005)
A. Garcia-Silva, M. Szomszor, H. Alani, и O. Corcho. The Fifth International Conference on Knowledge Capture (K-Cap'09) - First International Workshop on Collective Knowledge Capturing and Representation (CKCaR'09), Informatica, (2009)
J. Reisinger, и R. Mooney. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, стр. 109--117. Association for Computational Linguistics, (2010)
L. Ratinov, D. Roth, D. Downey, и M. Anderson. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, стр. 1375--1384. Stroudsburg, PA, USA, Association for Computational Linguistics, (2011)