Identity is fundamental to ontology, and especially to information systems ontologies. Identity is well known in metaphysics and in database conceptual modeling. In the latter case, it is an accepted best practice to specify a primary key for rows in a ta
If everyone would create good metadata for the purposes of describing their goods, services and information, it would be a trivial matter to search the Internet for highly qualified, context-sensitive results: a fan could find all the downloadable music i
It is important to differentiate between text data mining and information access (or information retrieval, as it is more widely known)... the goal of data mining is to discover or derive new information from data, finding patterns across datasets, and/o
Ever notice that before you can really do a good web search, you have to actually know something about your search topic? Let's say you want to learn more about jazz. But you don't know any of the "keywords," the musicians, the composers, the talent. S
Tagging is great...But can tagging be better? Yes. For example, how do you specify Paris (the city) as opposed to Paris (the person)? By using contextual tags (e.g.; celebrity:paris or city:paris) to give "thing" tags meaning. words are no longer stripp
This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and
Lexical ambiguity is a fundamental problem in Information Retrieval (IR), especially in the medical domain. Many systems use a subset of the words contained in the document to represent the content, but they are faced with the problem of ambiguity.
You maintain a blog at, say, livejournal.com (but this can be anything) and you stay logged in there usually. You go to leave a comment at someblog.com (perhaps it's Movable Type, or Wordpress, or DeadJournal, ...) and you don't have an account there, s