FullText.exe is freely available for academic usage. The program generates a word-occurrence matrix, a co-occurrence matrix, and a normalized co-occurrence matrix from a set of text files and a word list.
eTBLAST is a unique search engine for searching biomedical literature. Our service is very different from PubMed. While PubMed searches for "keywords", our search engine lets you input an entire paragraph and returns MEDLINE abstracts that are similar to
provides a common platform for discussing extensions of the MediaWiki software that allow for simple, machine-based processing of Wiki content. This usually requires some form of "semantic annotation," a single solution for semantic annotation that fits t
Powerful Search Engine designed for Document Management, Competitive Intelligence, Press Analysis and Text Mining, Web Mining, Knowledge Discovery, Strategic Watch...Has Report Writer, Web Spider, Publisher, more...
"Swoogle is a search engine for the Semantic Web on the Web. Swoogle crawl the World Wide Web for a special class of web documents called Semantic Web documents, which are written in RDF."
Information is stuck inside HTML pages, formatted in esoteric ways, difficult for machines to process. "Web 3.0", precursor to a refined semantic web, will change this. ‘Web 3.0′ will transform web sites into web services. Unstructured information bec
Discussion of flaws and potential solutions to using social bookmarking sites (from del.icio.us to digg) as folks monetize, abuse, trick, and tweak them: no uniform tagging conventions, flat tag structures (non-relational), overly-generalized tags (catego
With this Web page, we are opening some aspects of hakia R&D to the view of our users. We undertook highly specific research tasks solely dedicated to the advancement of the core-competency in Web search. The main challenge is to make science work in a co
Taxonomy of Markup · I use a taxonomy of markup which I'm pretty sure was first advanced in the seminal November 1987 CACM article Markup systems and the future of scholarly text processing, by Coombs, Renear, and DeRose, which was the first place I ever
Report from the 13th ACM Conference on Information and Knowledge Management, Washington DC, USA, 2004. Sponsors: ACM Special Interest Group on Information Retrieval, ACM Association for Computing Machinery
RxNorm, a standardized nomenclature for clinical drugs, is produced by the National Library of Medicine (NLM). In this context, a clinical drug is a pharmaceutical product given to (or taken by) a patient with a therapeutic or diagnostic intent. In RxNorm
Semantic similarity, also called semantic relatedness or semantic closeness/proximity/nearness, is a concept whereby a set of documents or terms within term lists are assigned a metric based on the likeness of their meaning / semantic content.
With due respect, I think you, and even more aggresiously Clay Shirkey, have been misrepresenting what the Semantic Web is, and critiquing based on that misunderstanding, not on the reality. Folks like Danny Hillis and Nova Spivack who were listening got
Just as colonies of social insects such as ants and bees are able to perform intelligent collective behaviors without centralized control, the millions, or even billions, of humans and programs roaming independently through the Semantic Web, selectively r
Building a centralized database to process billions of open-ended queries per day is a mammoth undertaking. It appears that Google, who perhaps is the only company on the planet with enough imagination, incentive, and expertise to effectively build such a
Metaweb holds the promise of all connected humankind weaving a tapestry of connections that more and more of us will be able to stand back and say, "Hmmm....I see a pattern here" and thus be able to invent ever higher value, solve deeper and more profound
If you search Google Base, you will find many records describing this camera, all loaded by different people. Data in one record may duplicate the data in another record. Or even worse, data in one may disagree with the others and no attempt has been mad
Category search within digital repositories is poorly supported. This means that people wishing to access the assets of digital repositories are largely limited to keyword search, which means they must know what they want in order to look for it. Our part
Automatic semantic annotation of information content is an open problem, but is crucial to the realization of the Semantic Web. Annotation systems require the initial definition of an ontology and as well as a knowledge base. Both of these resources work
Topics include: Merging Results from Independent SPARQL Queries, Home from SWAP2006, DBin for Power Users to Create Discussion Groups, Use of URIs for Naming, Another Cool Thing About GRDDL, XQuery and RDF, Dark Side of the Semantic Web, CSS in RDF+, QOT
We believe that the enterprise ontology will become a cornerstone in many information systems in the future. In general terms, an ontology is an organization of a body of knowledge or, at least, an organization of a set of terms related to a body of know
I will discuss how RSS and its taxonomy module can be used as a central format to carry metadata collected in a classical news format, such as XMLNews-Story, to RDF or relational databases and XML Topic Maps. Readers should have basic familiarity with RSS
From legacy relational databases to the semantic web, the China Academy of Traditional Chinese Medicine (CATCM), where over 70 legacy relational databases are semantically interconnected by an ontology with over 70 classes and 800 properties, providing in
In this article, I will discuss how RSS 1.0 and its taxonomy module can be used as a central format to carry metadata collected in a classical news format, such as XMLNews-Story, to RDF or relational databases and XML Topic Maps. Readers should have basic
The Role of Semantic Web in Web 2.0: Partner or Follower? Currently, the web phenomenon that is driving the best developers and captivating the best entrepreneurs is Web 2.0. Web 2.0 encompasses some of today's most exciting web-based applications: mas
Data on the Semantic Web is semi-structured and does not follow one fixed schema. Faceted browsing is a natural technique for navigating such data, partitioning the information space into orthogonal conceptual dimensions. Current faceted interfaces are ma
Abstract from downloadable PDF: Data on the Semantic Web is semi-structured and does not follow one fixed schema. Faceted browsing is a natural technique for navigating such data, partitioning the information space into orthogonal conceptual dimensions.
The Semantic Web is expected to provide more benefits to software engineering. Over the past five years there have been a number of attempts to bring together languages and tools, such as the UML, developed for Software Engineering with Semantic Web langu
[Sample clip]: For two systems to communicate they must commit to a common ontology. It doesn't matter how elegant or clever your ontology is, if no one else shares it, you don't participate in anything broader than your own ontology...
Welcome to the official 4th European Semantic Web Conference site. ESWC 2007 will take place from 3-7th, June 2007 in the Tyrol region of Innsbruck, Austria. This year's event will host a variety of workshops, tutorials, demonstrations and posters ded
This diagram depicts a spectrum of information sharing capabilities. Moving from lower right to upper left of the diagram, we see that more expressive forms of metadata and semantic modeling encompass the simpler forms, and extend their capabilities. From
Semantic technologies have become central to a broad range of research and development initiatives. This diagram visualizes the intersections of four major development themes in the semantic wave: networking (e.g., semantic web, grid & p2p), content (e.g.
Videos, photos, semantic tools online, semantic web challenge apps, industry talks, keynote talks, announcements, news, abstracts, sponsors, speakers, and more reportage from the ISWC.2006.
Business management, government management, and military defense management...This illustration charts uses of the predicted "semantic web" by enterprise and government.
The challenge that the Yahoos and Microsofts of the world have is that they are still beholden to the older corporate model of the world, and tend to denigrate their user generated content as being so much fluff. Thus when the Web 2.0 explosion occurred,
The goal of the SWS Challenge is to develop a common understanding of various technologies intended to facilitate the automation of mediation, choreography and discovery for Web Services using semantic annotations. The intent of this challenge is to explo
We have a limited “Semantic Web” appearing without the complex technologies that have been developed for it. Will the trend continue? Can it, using existing technologies, or will developers eventually ‘find’ RDF, OWL and SPARQL? Should the appeara
The Semantic Web, where machines are able to read the contents of documents as readily as people can, now has all the standards and technologies it needs to succeed, according to W3C director Tim Berners-Lee. Speaking at the World Wide Web 2006 conferen
Downloadable articles and audio files from the 2006 Semantic Technology Conference held in San Jose, CA. Material is geared toward enterprise and business use of semantic web technologies, although not limited to that topic.
Not long after he invented and unleashed the World Wide Web, Tim Berners-Lee realized that the limit to the effectiveness of the World Wide Web would be that while billions of documents could be linked and indexed, they relied on human interpretation to d
The 3rd Annual European Semantic Web Conference will be held in Budva, Montenegro from the 11th - 14th June, 2006. It will present the latest results in research and application in Semantic Web technologies (including knowledge markup languages, Semantic
The Semantic Technology Conference (SemTech) enters its third year as the pre-eminent meeting place for the growing community of developers, entrepreneurs, technology architects and researchers who are building software and systems based on semantic techn
The Semantic Web is a project to create a universal medium for information exchange by putting documents with computer-processable meaning (semantics) on the World Wide Web.
XFN™ (XHTML Friends Network) is a simple way to represent human relationships using hyperlinks. In recent years, blogs and blogrolls have become the fastest growing area of the Web. XFN enables web authors to indicate their relationship(s) to the people
Microformats are (officially) a set of simple, open data formats built upon existing and widely adopted standards that are designed for humans first and machines second. ...Microformats are about using the standards we all know and love to convey as much
So, what are you waiting for?The network effect tells us that the value of a technology increases the more it is used. Microformats are rapidly experiencing the benefits of this effect. Innovative publishers are publishing microformats, while innovative
Microformats are small and gentle syntactic touchups for your web pages.They have one major purpose: to make your data readable by both man and machine...The machine-readable-data (and thus the microformat) concept is not new; it has a very recent fo
A. Ankolekar, M. Krötzsch, T. Tran, and D. Vrandecic. WWW '07: Proceedings of the 16th international conference on World Wide Web, page 825--834. New York, NY, USA, ACM Press, (2007)