Webbots, Spiders, and Screen Scrapers is "unmatched to my knowledge in how it covers PHP/CURL. It explains to great details on how to write web clients using PHP/CURL, what pitfalls there are, how to make your code behave well and much more."
OpenAcoon ist eine als OpenSource zur Verfügung stehende Suchmaschine. Die Software wird seit Jahren von die Suchmaschine Acoon eingesetzt und wird von dieser auch weiter entwickelt.
OpenAcoon ist in Pascal geschrieben und arbeitet derzeit ausschließlich unter Windows. Wir arbeiten aber bereits daran die Sourcen für FreePascal anzupassen, damit die Software sowohl unter Windows, als auch unter Linux läuft.
What the world needs now is not another metasearch engine. Mind you, having more and better and even free metasearch engines is a good thing, but there are already many metasearch engines, each with different strengths and weaknesses, and even some that are free and open source (e.g., see Oregon State’s LibraryFind). Metasearch isn’t an effective solution for the problem at hand.
In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn't find and index for users who search on Google. Specifically, when we encounter a <FORM> element on a high-quality site, we might choose to do a small number of queries using the form. For text boxes, our computers automatically choose words from the site that has the form; for select menus, check boxes, and radio buttons on the form, we choose from among the values of the HTML. Having chosen the values for each input, we generate and then try to crawl URLs that correspond to a possible query a user may have made. If we ascertain that the web page resulting from our query is valid, interesting, and includes content not in our index, we may include it in our index much as we would include any other web page.
DiscoverLibrary is a new search and discovery tool for the library’s vast collection of resources. A simple search box will bring back results from a number of different sources, including Acorn, the library’s catalog, and the Vanderbilt TV News Archive. Additionally, many of the library’s online article databases are searchable through DiscoverLibrary on the second tab.
This is the initial release of DiscoverLibrary, and its development is an ongoing process. Over time we will add new resources and features, as well as refine the user interface. Please help us make the new service better by leaving your suggestions and comments in the box to the right.
The Scopus Application Program Interface (API) enables you to search the largest abstract and citation database of peer-reviewed literature and quality web sources.
You can select Scopus data elements and create your own mashups.
The API returns Scopus data in a format that is easily integrated into an application or your web site.
Yahoo’s embrace of all things open continues today - expect an announcement in an hour or so that they are expanding their Open Search Platform that we wrote about last month.
Gemäß dem Motto "aus Raider wird Twix" stellt sich die Videosuchmaschine yovisto (vormals "Osotis") auf der diesjährigen CeBIT 2008 im neuen Gewand vor. Was ist neu, außer dass es einen neuen Namen und ein neues Layout gibt? Einiges!
Nothing is more practical than a good theory. A banal statement, considering that a theory should always enable its users to easily derive the statements they need for practice.
But a theory for catalogs or cataloging? Is that really necessary? A question anyone is likely to ask who has never been confronted with the matter nor considered it with any seriousness.
Using Internet search engines, and knowing their operation is fully automated, people tend to view with skepticism all practical and theoretical effort invested in catalogs. Any good search engine, however, has to be be based on a good theory - though that one may differ quite a bit from a catalog theory.