English translation of selected chapters of the WikiWord thesis "Automatischer Aufbau eines multilingualen Thesaurus durch Extraktion semantischer und lexikalischer Relationen aus der Wikipedia" by Daniel Kinzler. Translation by the author.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
the data here is useful for testing classification / clustering, and the accuracy of indexing techniques. However the datasets are too small to make claims about the efficiency of indexing.
J. Esparza, and F. Reiter. 31st International Conference on Concurrency Theory (CONCUR 2020), volume 171 of Leibniz International Proceedings in Informatics (LIPIcs), page 10:1--10:16. Dagstuhl, Germany, Schloss Dagstuhl--Leibniz-Zentrum für Informatik, (2020)Preprint: <a href="https://arxiv.org/abs/2007.03291">Link</a><br>#conference.
D. Willems, and L. Vuurpijl. Proceedings of the Ninth international conference on document analysis and recognition, page 869-873. Curitiba, Brazil, (2007)
M. Sahami, S. Dumais, D. Heckerman, and E. Horvitz. Learning for Text Categorization: Papers from the 1998 Workshop, Madison, Wisconsin, AAAI Technical Report WS-98-05, (1998)
R. Neßelrath, and J. Alexandersson. Proceedings of the 6th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems. Twenty-First International Joint Conference On Artificial Intelligence (IJCAI -09), in Conjunction with 6th IJCAI Workshop on Knowledge and Reasoning in Practical Dialogue Systems (KRPD-09), July 12, Pasadena, California, United States, page 46-51. IJCAI 2009, (July 2009)