bookmarks
- Research Interests Comparator (RIC) is our fourth electronic text mining project. The goal of the RIC system is to dramatically improve the ability of biom...Research Interests Comparator (RIC) is our fourth electronic text mining project. The goal of the RIC system is to dramatically improve the ability of biomedical researchers to find information that is relevant to their areas of study, and to provide them
- Powerful Search Engine designed for Document Management, Competitive Intelligence, Press Analysis and Text Mining, Web Mining, Knowledge Discovery, Strateg...Powerful Search Engine designed for Document Management, Competitive Intelligence, Press Analysis and Text Mining, Web Mining, Knowledge Discovery, Strategic Watch...Has Report Writer, Web Spider, Publisher, more...
- Stanford InfoLab
- Hpricot vs Mechanize vs ScrAPI vs Watir vs ScRUBYt! vs…?
- Crowbar is a web scraping environment based on the use of a server-side headless mozilla-based browser. It is used as a research prototype to investigate h...Crowbar is a web scraping environment based on the use of a server-side headless mozilla-based browser. It is used as a research prototype to investigate how to enable the running of Piggy Bank javascript scrapers from the command line and thus automatin
- webcast on mining web-based content
- For serious researchers. Archive, webscrape, bookmark, search on tags...
- A web service (or web application) for researchers who wish to cache and webscrape documents and organize their materials.
- It is important to differentiate between text data mining and information access (or information retrieval, as it is more widely known)... the goal of data...It is important to differentiate between text data mining and information access (or information retrieval, as it is more widely known)... the goal of data mining is to discover or derive new information from data, finding patterns across datasets, and/o
- Feed43 engine converts free-form HTML or XML documents to valid RSS feeds by extracting snippets of text or HTML by means of applying search patterns, and ...Feed43 engine converts free-form HTML or XML documents to valid RSS feeds by extracting snippets of text or HTML by means of applying search patterns, and then joining these snippets together using output templates to form user-friendly content of feed's


groups