English translation of selected chapters of the WikiWord thesis "Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia" by Daniel Kinzler. Translation by the author.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
the data here is useful for testing classification / clustering, and the accuracy of indexing techniques. However the datasets are too small to make claims about the efficiency of indexing.
C. Au Yeung, N. Gibbins, and N. Shadbolt. Proceedings of the Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR 2008), co-located with ECIR 2008, Glasgow, United Kingdom, 31 March, 2008, page 48--61. (2008)
L. Bing, R. Guo, W. Lam, Z. Niu, and H. Wang. Proceedings of the 37th International ACM SIGIR Conference on Research &\#38; Development in Information Retrieval, page 767--776. New York, NY, USA, ACM, (2014)
B. Choi, and Z. Yao. Foundations and Advances in Data Mining, volume 180 of Studies in Fuzziness and Soft Computing, Springer, Berlin / Heidelberg, (2005)
A. Sun, E. Lim, and W. Ng. Proceedings of the 4th international workshop on Web information and data management, page 96--99. New York, NY, USA, ACM, (2002)