Short introduction to Vector Space Model (VSM) In information retrieval or text mining, the term frequency - inverse document frequency also called tf-idf, is
This is the first of a three-part series called TFIDF In Libraries, where “relevancy ranking” will be introduced. In this part, term frequency/inverse document frequency (TFIDF) — a common mathematical method of weighing texts for automatic classification and sorting search results — will be described.
B. Fortuna, M. Grobelnik, and D. Mladenic. Proceedings of the 15th international conference on World Wide Web, page 949--950. New York, NY, USA, ACM, (2006)
R. Jin, and A. Hauptmann. Lecture Notes in AI, Second International Conference on Intelligent
Text Processing and Computational Linguistics, Mexico City, Mexico, February 18 to 24, 2001., Springer, (2001)