- Short introduction to Vector Space Model (VSM) In information retrieval or text mining, the term frequency - inverse document frequency also called tf-idf,...Short introduction to Vector Space Model (VSM) In information retrieval or text mining, the term frequency - inverse document frequency also called tf-idf, is
- The ways of organizing information are finite. It can only be organized by location, alphabet, time, category, or hierarchy. These modes are applicable to ...The ways of organizing information are finite. It can only be organized by location, alphabet, time, category, or hierarchy. These modes are applicable to almost any endeavor—from your personal file cabinets to multinational corporations. They are the framework upon which annual reports, books, conversations, exhibitions, directories, conventions, and even warehouses are arranged.
- Ideas, issues, concepts, subjects - visualized!
- This is the project page for SecondString, an open-source Java-based package of approximate string-matching techniques. This code was developed by research...This is the project page for SecondString, an open-source Java-based package of approximate string-matching techniques. This code was developed by researchers at Carnegie Mellon University from the Center for Automated Learning and Discovery, the Department of Statistics, and the Center for Computer and Communications Security. SecondString is intended primarily for researchers in information integration and other scientists. It does or will include a range of string-matching methods from a variety of communities, including statistics, artificial intelligence, information retrieval, and databases. It also includes tools for systematically evaluating performance on test data. It is not designed for use on very large data sets.
- Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, page 81--88. New York, NY, USA, ACM, (2008)
- HT '08: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia, page 157--166. New York, NY, USA, ACM, (2008)
- Universität Düsseldorf, PhD thesis, (2009)
- Information - Wissenschaft und Praxis 59(2):77--90 (2008)
- Graphics Press, Second edition, (2001)
- ACM Transactions on Information Systems 20(4):422--446 (October 2002)
- SIGIR '00: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 41--48. New York, NY, USA, ACM, (2000)
- Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, (1999)
- HT '08: Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, page 81--88. New York, NY, USA, ACM, (2008)
- Library Review 55(5):291-300 (2006)
- The Semantic Web: Research and Applications, volume 4011 of Lecture Notes in Computer Science, page 411-426. Heidelberg, Springer, (June 2006)
- JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
- HLT-NAACL, page 329-336. (2004)
- (2002)http://mallet.cs.umass.edu .


user