Today, to search is to google. Specifically, it is to use Google’s search engine to nd something on the Web. As for those other searches that once helped de ne the human condition—for meaning, love, purpose, or God—those have, in little more than a decade, assumed almost secondary importance. From its now almost apocryphal beginnings…
Whichever web page you visit, use this tool to search over linked content. Just click the "G"-with-chain-link icon in your status bar to open or close the toolbar (or ctrl-alt-s). Great for searching from tables-of-contents, bookmarks, or blogs!
The term “Semantic Search” is certainly not new. However, it has taken on a new dimension and implications in both search and social engines today. In addition, it has had a strong impact on targeted semantic advertising.
ThManager is an Open Source Tool for creating and visualizing SKOS RDF vocabularies, a W3C initiative for the representation of knowledge organization systems such as thesauri, classification schemes, subject heading lists, taxonomies, and other types of controlled vocabulary. ThManager facilitates the management of thesauri and other types of controlled vocabularies, such as taxonomies or classification schemes. The tool has been implemented in Java and has the following features:
Multi-platform (Windows, Unix). As it has been developed in Java and the storage of metadata records is managed directly through the file system, the application can be deployed in any platform with the minimum requirement of having installed a Java virtual machine.
Multilingual. The application has been developed following the Java internationalization methodology. Nowadays, there are Spanish and English versions. With little effort, other languages could be supported.
Selection and filtering of the thesauri stored in the local repository.
Description of thesauri by means of metadata in compliance with a Dublin Core based application profile for thesaurus (See application profile) . These metadata can be either visualized in HTML or edited through a form.
Visualization of thesaurus concepts. The visualization interface includes the following widgets:
Alphabetic viewer: It provides the list of thesaurus concepts alphabetically ordered in the selected language.
Hierarchical viewer: It provides a tree showing the hierarchical structure of thesaurus concepts.
Concept viewer: For a selected concept it shows all the properties allowing additionally the navigation to the related concepts by means of hyperlinks.
Search tool: It facilitates search of concepts. The searching process is based on preferred labels allowing the following criteria: "equals", "starts with" and "contains".
Edition of thesaurus content. The tool provides an edition interface to modify the content of a thesaurus: creation of concepts, deletion of concepts, and update of concept properties.
Exchange of thesauri according to SKOS format. The export operation includes the export of thesaurus metadata.
Extraction of related concepts in WordNet. It generates an automatic mapping of thesaurus concepts against the concepts of Wordnet lexical database.
On-line help by means of PDF visualization.
Everything search engine
Locate files and folders by name instantly.
Small installation file
Clean and simple user interface
Quick file indexing
Quick searching
Minimal resource usage
Share files with others easily
Real-time updating
More...
InSight Desktop Search
Easily Search For Files/Folders across HDD.
Support for shared Network Places.
Support for Metadata.
Dedicated Music Search and Playback.
Search Outlook Emails and Contacts.
Search for articles on Wikipedia.
Quick Launch Shortcuts.
Small Index size and Live updations.
InSight Preview.
Quick Disk Indexing Speed : 1-2 min.
Puggle is an open-source desktop search engine written exclusively in Java. It provides full text and metadata search over files, folders, music, photos, web pages and more that are stored locally on your computer.
DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia, and to link other data sets on the Web to Wikipedia data. We hope this will make it easier for the amazing amount of information in Wikipedia to be used in new and interesting ways, and that it might inspire new mechanisms for navigating, linking and improving the encyclopaedia itself.
Apache Tika is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. You can find the latest release on the download page. See the Getting Started guide for instructions on how to start using Tika.
Katta is a scalable, failure tolerant, distributed, data storage for real time access.
Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles.
* Makes serving large or high load indices easy
* Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers
* Replicate shards on different servers for performance and fault-tolerance
* Supports pluggable network topologies
* Master fail-over
* Fast, lightweight, easy to integrate
* Plays well with Hadoop clusters
* Apache Version 2 License
New Feedly combines Google Reader, friendfeed, Twitter in great way for social network addicts http://ff.im/15mmv (via @infomaniac) [from http://twitter.com/jomiralb/statuses/1216843945]
New Feedly combines Google Reader, friendfeed, Twitter in great way for social network addicts http://ff.im/15vAq (via @iggykin) [from http://twitter.com/jomiralb/statuses/1216756614]
Sphinx is a full-text search engine, distributed under GPL version 2. Commercial license is also available for embedded use.
Generally, it's a standalone search engine, meant to provide fast, size-efficient and relevant fulltext search functions to other applications. Sphinx was specially designed to integrate well with SQL databases and scripting languages. Currently built-in data sources support fetching data either via direct connection to MySQL or PostgreSQL, or using XML pipe mechanism (a pipe to indexer in special XML-based format which Sphinx recognizes).
As for the name, Sphinx is an acronym which is officially decoded as SQL Phrase Index. Yes, I know about CMU's Sphinx project.
Ever notice that before you can really do a good web search, you have to actually know something about your search topic? Let's say you want to learn more about jazz. But you don't know any of the "keywords," the musicians, the composers, the talent. S
Ever notice that before you can really do a good web search, you have to actually know something about your search topic? Let's say you want to learn more about jazz. But you don't know any of the "keywords," the musicians, the composers, the talent. S
A meme ID is like a magic bullet in the document saying "I am about this meme." Can we construct a general taxonomy of memes, and then specialized lists of meme IDs that authors will feel comfortable adding to their documents?
A meme ID is like a magic bullet in the document saying "I am about this meme." Can we construct a general taxonomy of memes, and then specialized lists of meme IDs that authors will feel comfortable adding to their documents?
R. Baeza-Yates, L. Calderón-Benavides, и C. González-Caro. Proceedings of String Processing and Information Retrieval (SPIRE ), том 4209 из Lecture Notes in Computer Science, стр. 98--109. Springer, (2006)
X. Wang, и C. Zhai. Proceedings of the 30 th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2007, (2007)