With this Web page, we are opening some aspects of hakia R&D to the view of our users. We undertook highly specific research tasks solely dedicated to the advancement of the core-competency in Web search. The main challenge is to make science work in a co
Query log data for ad targeting
A WWW2006 paper out of Microsoft Research, "Finding Advertising Keywords on Web Pages" (PDF), claims that query log data is particularly useful for ad targeting.
Specifically, the researchers extracted from MSN query logs the keywords some people used to find a given page. They tested using that as one of many features for ad targeting. In their results, it was one of the most effective features.
Very interesting. It has always been harder to target ads to content than to search results because intent is much less clear.
By using the query log data in this way, the researchers were effectively using the intent of the searchers that arrived at the page as a proxy for the intent of everyone who arrived at the page.
Query log data for ad targeting
A WWW2006 paper out of Microsoft Research, "Finding Advertising Keywords on Web Pages" (PDF), claims that query log data is particularly useful for ad targeting.
Specifically, the researchers extracted from MSN query logs the keywords some people used to find a given page. They tested using that as one of many features for ad targeting. In their results, it was one of the most effective features.
Very interesting. It has always been harder to target ads to content than to search results because intent is much less clear.
By using the query log data in this way, the researchers were effectively using the intent of the searchers that arrived at the page as a proxy for the intent of everyone who arrived at the page.
This is the Watson Web interface for searching ontologies and semantic documents using keywords. This interface is subject to frequent evolutions and improvements. If you want to share your opinion, suggest improvement or comment on the results, don't hesitate to contact us... At the moment, you can enter a set of keywords (e.g. "cat dog old_lady"), and obtain a list of URIs of semantic documents in which the keywords appear as identifiers or in literals of classes, properties, and individuals. You can also use "jokers" in the keywords (e.g., "ca? dog*"). Navigation in the results follows very simple principles. First, whenever a sign appears, it can be used to display additional information about the element it is attached with. Second, every URI is clickable. A URI is a link to a page describing either the entity or the semantic document it corresponds to, and gives access to additional functionalities using this particular entity or document.
ArcGIS Online is a unified Web portal designed by Environment System Research Institute (ESRI). It contains a rich collection of Web maps, layers, and services contributed by GIS users throughout the world. The metadata about these GIS resources reside in data silos that can be accessed via a Web API. While this is sufficient for simple syntax-based searches, it does not support more advanced queries, e.g., finding maps based on the semantics of the search terms, or performing customized queries that are not pre-designed in the API. In metadata, titles and descriptions are commonly available attributes which provide important information about the content of the GIS resources. Ho
From legacy relational databases to the semantic web, the China Academy of Traditional Chinese Medicine (CATCM), where over 70 legacy relational databases are semantically interconnected by an ontology with over 70 classes and 800 properties, providing in
Microformats are small and gentle syntactic touchups for your web pages.They have one major purpose: to make your data readable by both man and machine...The machine-readable-data (and thus the microformat) concept is not new; it has a very recent fo
Powerful Search Engine designed for Document Management, Competitive Intelligence, Press Analysis and Text Mining, Web Mining, Knowledge Discovery, Strategic Watch...Has Report Writer, Web Spider, Publisher, more...
Aperture is a Java framework for extracting and querying full-text content and metadata from various information systems (e.g. file systems, web sites, mail boxes) and the file formats (e.g. documents, images) occurring in these systems.
The Open Text Mining Interface (OTMI) is an initiative from Nature Publishing Group (NPG). It aims to enable scholarly publishers, among others, to disclose their full text for indexing and text-mining purposes but without giving it away in a form that is
ThManager is an Open Source Tool for creating and visualizing SKOS RDF vocabularies, a W3C initiative for the representation of knowledge organization systems such as thesauri, classification schemes, subject heading lists, taxonomies, and other types of controlled vocabulary. ThManager facilitates the management of thesauri and other types of controlled vocabularies, such as taxonomies or classification schemes. The tool has been implemented in Java and has the following features:
Multi-platform (Windows, Unix). As it has been developed in Java and the storage of metadata records is managed directly through the file system, the application can be deployed in any platform with the minimum requirement of having installed a Java virtual machine.
Multilingual. The application has been developed following the Java internationalization methodology. Nowadays, there are Spanish and English versions. With little effort, other languages could be supported.
Selection and filtering of the thesauri stored in the local repository.
Description of thesauri by means of metadata in compliance with a Dublin Core based application profile for thesaurus (See application profile) . These metadata can be either visualized in HTML or edited through a form.
Visualization of thesaurus concepts. The visualization interface includes the following widgets:
Alphabetic viewer: It provides the list of thesaurus concepts alphabetically ordered in the selected language.
Hierarchical viewer: It provides a tree showing the hierarchical structure of thesaurus concepts.
Concept viewer: For a selected concept it shows all the properties allowing additionally the navigation to the related concepts by means of hyperlinks.
Search tool: It facilitates search of concepts. The searching process is based on preferred labels allowing the following criteria: "equals", "starts with" and "contains".
Edition of thesaurus content. The tool provides an edition interface to modify the content of a thesaurus: creation of concepts, deletion of concepts, and update of concept properties.
Exchange of thesauri according to SKOS format. The export operation includes the export of thesaurus metadata.
Extraction of related concepts in WordNet. It generates an automatic mapping of thesaurus concepts against the concepts of Wordnet lexical database.
On-line help by means of PDF visualization.
PoolParty Thesaurus Manager
Meets high expectations on usability
Provides customisable metadata schemas
Strictly built on open W3C standards
PoolParty Extractor
Highly performant text mining algorithms
Adresses different data sources
Delivers relevant context information
PoolParty Search
High end refinement assistants
Search different sources with one API
Ready for third party integration
eTBLAST is a unique search engine for searching biomedical literature. Our service is very different from PubMed. While PubMed searches for "keywords", our search engine lets you input an entire paragraph and returns MEDLINE abstracts that are similar to
Microformats are small and gentle syntactic touchups for your web pages.They have one major purpose: to make your data readable by both man and machine...The machine-readable-data (and thus the microformat) concept is not new; it has a very recent fo
ConceptNet is a freely available commonsense knowledgebase and natural-language-processing toolkit which supports many practical textual-reasoning tasks over real-world documents right out-of-the-box.
SITE INFORMATION ARCHITECTURE OPTIMIZATION
This project will design an information architecture analysis tool targeted towards webmasters and content providers. The project will provide site and page popularity information as well as analyzing and optimizing site structures, themes, meta tags, comments, keywords and related attributes.
IBM OmniFind Personal E-mail Search A powerful semantic search engine that enables you to search your e-mail easily and effectively; plug-ins are available for Microsoft Outlook and Lotus Notes mail systems.
Das Microsoft-Team fand heraus, dass Gruppen, die aus demographischen Angaben wie Alter oder Herkunft gebildet werden, bei den meisten Suchanfragen wenig gemeinsam haben. Bei Gruppen mit Personen mit ähnlichem Interesse sieht das ganz anders aus – hier werden Suchergebnisse oft ähnlich bewertet. Festgestellt wurde außerdem, dass Nutzer zwar glaubten, ähnliche Formulierungen bei Suchanfragen zu verwenden, insgesamt aber sehr unterschiedliche Worte wählten.
Bei der Frage nach den Vor- und Nachteilen der Telearbeit suchte eine Testperson beispielsweise nach dem englischen Wort dafür: "Telecommuting". Andere nutzten hingegen "Kostenvorteile bei der Arbeit von zuhause" oder "Wirtschaftlichkeitsvergleich Telecommuting kontra Büro". Wenn die Suchmaschine wüsste, dass alle drei die gleichen Interessen haben, würde das bessere Ergebnisse ermöglichen, sagt Teevan. "Ich spreche anders über eine bestimmte Sache als meine Mitmenschen. Wenn man diese verschiedenen Wege nutzt, ist es für uns einfacherer, Seiten zu finden, auf denen jemand die Dinge auf seine eigene Art ausdrückt."
04 June, 2006: Just passing some greate news:Technorati has introduced Microformats Search:It even has specific search engine entries: hcard for Contacts, hcalendar for Events and hreview for Reviews. Note, if you publish microformats in your blog, just keep on
"Swoogle is a search engine for the Semantic Web on the Web. Swoogle crawl the World Wide Web for a special class of web documents called Semantic Web documents, which are written in RDF."
D. Schlör, J. Pfister, и A. Hotho. 2023 the 7th International Conference on Medical and Health Informatics (ICMHI), стр. 136–141. New York, NY, USA, Association for Computing Machinery, (2023)
D. Schlör, J. Pfister, и A. Hotho. 2023 the 7th International Conference on Medical and Health Informatics (ICMHI), стр. 136–141. New York, NY, USA, Association for Computing Machinery, (2023)
C. Xiong, R. Power, и J. Callan. Proceedings of the 26th International Conference on World Wide Web, стр. 1271--1279. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2017)