I am investigating computational models for linguistic structures and processes, with application to language technologies and to the documentation of endangered languages. My current focus is on efficient query for databases of hierarchically annotated data. After completing a PhD on computational phonology at the University of Edinburgh in 1990, I worked on a series of European research projects and conducted linguistic fieldwork in Cameroon with SIL. In 1998 I moved to the University of Pennsylvania, becoming Associate Director of the LDC, and working on models and tools for linguistic annotation. In 2002 I returned home to Australia and established the Melbourne University Language Technology Group. In 2007 I was awarded the Kelvin Medal for excellence in teaching.
Key Activities: Coordinating first year Informatics; developing the Natural Language Toolkit; writing a textbook on NLP; leading the Language Technology Group; working on an NSF project on Querying Linguistic Databases; and editing Cambridge Studies in Natural Language Processing and the ACL Anthology.
Key Publications: Natural Language Processing in Python; Computational phonology: A constraint-based approach (Cambridge); A formal framework for linguistic annotation (Speech Communication); Seven dimensions of portability for language documentation and description (Language); Designing and evaluating an XPath dialect for linguistic queries (ICDE).
Online Demo of the TreeTagger. A tool for annotating text with part-of-speech and lemma information which has been developed at the Institute for Computational Linguistics of the University of Stuttgart.
Shalmaneser is a supervised learning toolbox for shallow semantic parsing, i.e. the automatic assignment of semantic classes and roles to text. The system was developed for Frame Semantics; thus we use Frame Semantics terminology and call the classes frames and the roles frame elements. However, the architecture is reasonably general, and with a certain amount of adaption, Shalmaneser should be usable for other paradigms (e.g., PropBank roles) as well. Shalmaneser caters both for end users, and for researchers.
Alle Programme und Resourcen auf der Liste sind frei, d.h. kostenlos (für Forschungszwecke) verfügbar, auf deutschsprachige Texte anwendbar und sofort startklar, d.h. sie müssen nicht erst mit Hilfe von z.B. annotierten Korpora trainiert werden. Die Liste ist natürlich unvollständig (Stand 22.5.2007).
The Talking Heads experiment raises questions in three areas of research: How do words get their meanings? Is artificial intelligence possible? How should machines interact with humans?
D. Jurafsky, and J. Martin. Prentice Hall series in artificial intelligence Prentice Hall, Pearson Education International, Englewood Cliffs, NJ, 2. ed., Pearson International Edition edition, (2009)
M. Pasca. CIKM '07: Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, page 683--690. New York, NY, USA, ACM, (2007)
J. Liu, and J. Wang. Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, page 129-134. Beijing, China, (2007)
D. Leite, and L. Rino. Fourth Workshop in Information and Human Language Technology (TIL'06 Poster Section) in the Proceedings of International Joint Conference, 10th Ibero-American Artificial Intelligence Conference, 18th Brazilian Artificial Intelligence Symposium, 9th Brazilian Neural Networks Symposium, IBERAMIA-SBIA-SBRN, Ribeirão Preto, Brazil, ICMC-USP, (October 2006)
J. Wermter, and U. Hahn. 44th Annual Meeting of the Association for Computational Linguistics, page 785--792. Sydney, Australia, Association for Computational Linguistics, (July 2006)
T. Pardo, L. Antiqueira, M. Nunes, O. Oliveira Jr., and L. Costa. Proceedings of the International Conference on Communications, Circuits and Systems (ICCCAS'06) - Special Session on Complex Networks, page 2678-2682. Gui Lin, China, UESTC Press, (June 2006)
V. Nastase, and S. Szpakowicz. Proceedings of TextGraphs: the Second Workshop on Graph Based Methods for Natural Language Processing, page 29-32. New York City, Association for Computational Linguistics, (June 2006)
R. Mihalcea. Proceedings of the 7th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2006), volume 3878 of LNCS, page 249-262. Mexico City, Mexico, (February 2006)