FullText.exe is freely available for academic usage. The program generates a word-occurrence matrix, a co-occurrence matrix, and a normalized co-occurrence matrix from a set of text files and a word list.
eTBLAST is a unique search engine for searching biomedical literature. Our service is very different from PubMed. While PubMed searches for "keywords", our search engine lets you input an entire paragraph and returns MEDLINE abstracts that are similar to
provides a common platform for discussing extensions of the MediaWiki software that allow for simple, machine-based processing of Wiki content. This usually requires some form of "semantic annotation," a single solution for semantic annotation that fits t
Powerful Search Engine designed for Document Management, Competitive Intelligence, Press Analysis and Text Mining, Web Mining, Knowledge Discovery, Strategic Watch...Has Report Writer, Web Spider, Publisher, more...
"Swoogle is a search engine for the Semantic Web on the Web. Swoogle crawl the World Wide Web for a special class of web documents called Semantic Web documents, which are written in RDF."
Information is stuck inside HTML pages, formatted in esoteric ways, difficult for machines to process. "Web 3.0", precursor to a refined semantic web, will change this. ‘Web 3.0′ will transform web sites into web services. Unstructured information bec
Discussion of flaws and potential solutions to using social bookmarking sites (from del.icio.us to digg) as folks monetize, abuse, trick, and tweak them: no uniform tagging conventions, flat tag structures (non-relational), overly-generalized tags (catego
With this Web page, we are opening some aspects of hakia R&D to the view of our users. We undertook highly specific research tasks solely dedicated to the advancement of the core-competency in Web search. The main challenge is to make science work in a co
Taxonomy of Markup · I use a taxonomy of markup which I'm pretty sure was first advanced in the seminal November 1987 CACM article Markup systems and the future of scholarly text processing, by Coombs, Renear, and DeRose, which was the first place I ever
Report from the 13th ACM Conference on Information and Knowledge Management, Washington DC, USA, 2004. Sponsors: ACM Special Interest Group on Information Retrieval, ACM Association for Computing Machinery
RxNorm, a standardized nomenclature for clinical drugs, is produced by the National Library of Medicine (NLM). In this context, a clinical drug is a pharmaceutical product given to (or taken by) a patient with a therapeutic or diagnostic intent. In RxNorm
Semantic similarity, also called semantic relatedness or semantic closeness/proximity/nearness, is a concept whereby a set of documents or terms within term lists are assigned a metric based on the likeness of their meaning / semantic content.
With due respect, I think you, and even more aggresiously Clay Shirkey, have been misrepresenting what the Semantic Web is, and critiquing based on that misunderstanding, not on the reality. Folks like Danny Hillis and Nova Spivack who were listening got
Just as colonies of social insects such as ants and bees are able to perform intelligent collective behaviors without centralized control, the millions, or even billions, of humans and programs roaming independently through the Semantic Web, selectively r
Building a centralized database to process billions of open-ended queries per day is a mammoth undertaking. It appears that Google, who perhaps is the only company on the planet with enough imagination, incentive, and expertise to effectively build such a
Metaweb holds the promise of all connected humankind weaving a tapestry of connections that more and more of us will be able to stand back and say, "Hmmm....I see a pattern here" and thus be able to invent ever higher value, solve deeper and more profound
If you search Google Base, you will find many records describing this camera, all loaded by different people. Data in one record may duplicate the data in another record. Or even worse, data in one may disagree with the others and no attempt has been mad
Category search within digital repositories is poorly supported. This means that people wishing to access the assets of digital repositories are largely limited to keyword search, which means they must know what they want in order to look for it. Our part
Automatic semantic annotation of information content is an open problem, but is crucial to the realization of the Semantic Web. Annotation systems require the initial definition of an ontology and as well as a knowledge base. Both of these resources work
Topics include: Merging Results from Independent SPARQL Queries, Home from SWAP2006, DBin for Power Users to Create Discussion Groups, Use of URIs for Naming, Another Cool Thing About GRDDL, XQuery and RDF, Dark Side of the Semantic Web, CSS in RDF+, QOT
We believe that the enterprise ontology will become a cornerstone in many information systems in the future. In general terms, an ontology is an organization of a body of knowledge or, at least, an organization of a set of terms related to a body of know
I will discuss how RSS and its taxonomy module can be used as a central format to carry metadata collected in a classical news format, such as XMLNews-Story, to RDF or relational databases and XML Topic Maps. Readers should have basic familiarity with RSS
From legacy relational databases to the semantic web, the China Academy of Traditional Chinese Medicine (CATCM), where over 70 legacy relational databases are semantically interconnected by an ontology with over 70 classes and 800 properties, providing in
In this article, I will discuss how RSS 1.0 and its taxonomy module can be used as a central format to carry metadata collected in a classical news format, such as XMLNews-Story, to RDF or relational databases and XML Topic Maps. Readers should have basic
The Role of Semantic Web in Web 2.0: Partner or Follower? Currently, the web phenomenon that is driving the best developers and captivating the best entrepreneurs is Web 2.0. Web 2.0 encompasses some of today's most exciting web-based applications: mas
Data on the Semantic Web is semi-structured and does not follow one fixed schema. Faceted browsing is a natural technique for navigating such data, partitioning the information space into orthogonal conceptual dimensions. Current faceted interfaces are ma
Abstract from downloadable PDF: Data on the Semantic Web is semi-structured and does not follow one fixed schema. Faceted browsing is a natural technique for navigating such data, partitioning the information space into orthogonal conceptual dimensions.
The Semantic Web is expected to provide more benefits to software engineering. Over the past five years there have been a number of attempts to bring together languages and tools, such as the UML, developed for Software Engineering with Semantic Web langu
[Sample clip]: For two systems to communicate they must commit to a common ontology. It doesn't matter how elegant or clever your ontology is, if no one else shares it, you don't participate in anything broader than your own ontology...
Welcome to the official 4th European Semantic Web Conference site. ESWC 2007 will take place from 3-7th, June 2007 in the Tyrol region of Innsbruck, Austria. This year's event will host a variety of workshops, tutorials, demonstrations and posters ded
This diagram depicts a spectrum of information sharing capabilities. Moving from lower right to upper left of the diagram, we see that more expressive forms of metadata and semantic modeling encompass the simpler forms, and extend their capabilities. From
Semantic technologies have become central to a broad range of research and development initiatives. This diagram visualizes the intersections of four major development themes in the semantic wave: networking (e.g., semantic web, grid & p2p), content (e.g.
A. Ankolekar, M. Krötzsch, T. Tran, and D. Vrandecic. WWW '07: Proceedings of the 16th international conference on World Wide Web, page 825--834. New York, NY, USA, ACM Press, (2007)