This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and
Lexical ambiguity is a fundamental problem in Information Retrieval (IR), especially in the medical domain. Many systems use a subset of the words contained in the document to represent the content, but they are faced with the problem of ambiguity.
Collection of Ambiguous Statements. Helps Clarify the Enormity of the Semantic Web Project, in Which Machines Must Make Sense of Humanababble, or Folksononsense...
Lexical ambiguity arises when context is insufficient to determine the sense of a single word that has more than one meaning. Syntactic ambiguity arises when a sentence can be parsed in more than one way. Semantic ambiguity arises when a word or concept
In general, a namespace is an abstract container providing context for the items (names, or technical terms, or words) it holds and allows disambiguation of items having the same name...As a rule, names in a namespace cannot have more than one meaning, th
A meme ID is like a magic bullet in the document saying "I am about this meme." Can we construct a general taxonomy of memes, and then specialized lists of meme IDs that authors will feel comfortable adding to their documents?
A memespace has a unique alphanumeric identifier to disambiguate it from other memespaces. The present design for meme IDs is: MEMESPACE-TAXOSPACE-ID. Essentially, it's another controlled vocabulary...