English translation of selected chapters of the WikiWord thesis "Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia" by Daniel Kinzler. Translation by the author.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
the data here is useful for testing classification / clustering, and the accuracy of indexing techniques. However the datasets are too small to make claims about the efficiency of indexing.
C. Hoede, and L. Zhang. Proceedings of the 9th International Conference on Conceptual Structures (ICCS 2001), volume 2120 of Lecture Notes in Computer Science, page 15-28. Springer, (2001)
J. Hopcroft, T. Lou, and J. Tang. Proceedings of the 20th ACM International Conference on Information and Knowledge Management, page 1137--1146. New York, NY, USA, ACM, (2011)
S. Wu, J. Hofman, W. Mason, and D. Watts. Proceedings of the 20th international conference on World wide web, page 705--714. New York, NY, USA, ACM, (2011)
G. Krempl, D. Bodnar, and A. Hrubos. Advances in Intelligent Data Analysis XIV - 14th Int. Symposium, IDA 2015, St. Etienne, France, volume 9385 of Lecture Notes in Computer Science, page XXII--XXIII. Springer, (2015)
D. Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W. Ma. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, page 242--249. New York, NY, USA, ACM, (2004)
D. Shen, Z. Chen, Q. Yang, H. Zeng, B. Zhang, Y. Lu, and W. Ma. Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, page 242--249. New York, NY, USA, ACM, (2004)
C. Au Yeung, N. Gibbins, and N. Shadbolt. Proceedings of the Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR 2008), co-located with ECIR 2008, Glasgow, United Kingdom, 31 March, 2008, page 48--61. (2008)
L. Bing, R. Guo, W. Lam, Z. Niu, and H. Wang. Proceedings of the 37th International ACM SIGIR Conference on Research &\#38; Development in Information Retrieval, page 767--776. New York, NY, USA, ACM, (2014)
B. Choi, and Z. Yao. Foundations and Advances in Data Mining, volume 180 of Studies in Fuzziness and Soft Computing, Springer, Berlin / Heidelberg, (2005)
A. Sun, E. Lim, and W. Ng. Proceedings of the 4th international workshop on Web information and data management, page 96--99. New York, NY, USA, ACM, (2002)
L. Wu, M. Li, Z. Li, W. Ma, and N. Yu. MIR '07: Proceedings of the international workshop on Workshop on multimedia information retrieval, page 115--124. New York, NY, USA, ACM, (2007)