- Kurs i datalingvistikk og kognitiv vitskap ved UiB, notat og oppgåver, mm.
- «Jan Daciuk's page is the font of all knowledge FSA/FST»
- toread. All of it.
- «Traditionally, unification grammars are hand-coded. This is extremely time consuming, expensive and very difficult to scale. [...] we have developed a new...«Traditionally, unification grammars are hand-coded. This is extremely time consuming, expensive and very difficult to scale. [...] we have developed a new method for automatically extracting wide-coverage probabilistic unification (LFG) grammars from treebank resources. To achieve this, we first automatically annotate the treebank (such as Penn-II) with feature-structure information (LFG f-structures, approximating to basic predicate-argument structure). From the f-structure annotated treebank, we then automatically extract wide-coverage, probabilistic LFG approximations to parse new text»
- monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some languages) spoken data for fo...monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some languages) spoken data for fourteen South Asian languages: Assamese, Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Oriya, Punjabi, Sinhala, Tamil, Telegu and Urdu. The EMILLE monolingual corpora contain approximately 92,799,000 words (including 2,627,000 words of transcribed spoken data for Bengali, Gujarati, Hindi, Punjabi and Urdu). The parallel corpus consists of 200,000 words of text in English and its accompanying translations in Hindi, Bengali, Punjabi, Gujarati and Urdu. The annotated component includes the Urdu monolingual and parallel corpora annotated for parts-of-speech, together with twenty written Hindi corpus files annotated to show the nature of demonstrative use. The corpus is marked up using CES-compliant SGML, and encoded using Unicode.
- Parallel corpora, freely available
- MultiTree is a searchable database of hypotheses on language relationships. compare language trees and access bibliographical information on them see a g...MultiTree is a searchable database of hypotheses on language relationships. compare language trees and access bibliographical information on them see a graphical representation of every scholarly hypothesis on language relationships view information on every language share comments on hypotheses and add new hypotheses (as a registered user) access an interactive map of the language or family of your choice through LLMap
- NLTK for Prolog
- by: Mikaela Keller
- Very Google Suggest-like
- Lecture videos, slides etc
- by: Michel Simard Pierre Plamondon
- Evaluating multiple word/sentence alignment and WSD systems
- Documentation for LFG resources at UiB, by Paul Meurer
- Treebanks and Linguistic Theories, conference
- Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, page 35--42. Alicante, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, (2009)
- Universitetet i Bergen, Bergen, Norway, (2010)
- Proceedings of the Workshop on Human Judgements in Computational Linguistics, page 51--57. Manchester, Association for Computational Linguistics, Association for Computational Linguistics, (2008)
- International Journal of Communications Law and Policy (2009)
- Computational Linguistics 29(1):19-51 (2003)
- Proceedings of LFG09, page 317--337. Trinity College, Cambridge, CSLI Publications, (2009)
- NAACL '03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, page 48--54. Morristown, NJ, Association for Computational Linguistics, Association for Computational Linguistics, (2003)
- COLING-GEE '02 Proceedings of the 2002 Workshop on Grammar Engineering and Evaluation, 15, page 1--7. Morristown, NJ, Association for Computational Linguistics, Association for Computational Linguistics, (2002)
- Proceedings of the Eighth International Workshop on Treebanks and Linguistic Theories, page 71--82. Milano, EDUCatt, (2009)
- Proceedings of the 31st Annual Conference of the Association for Computational Linguistics, page 9--16. Columbus, Ohio, Association for Computational Linguistics, Association for Computational Linguistics, (1993)
- Computational Linguistics 19(2):263--311 (1993)
- COLING-GEE '02 Proceedings of the 2002 Workshop on Grammar Engineering and Evaluation, 15, page 1--7. Morristown, NJ, Association for Computational Linguistics, Association for Computational Linguistics, (2002)
- Proceedings of Treebanks and Linguistic Theories TLT '07, Bergen, Norway, NEALT, (2007)
- Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning, page 33--39. Borovets, Bulgaria, Association for Computational Linguistics, Association for Computational Linguistics, (2009)
- Proceedings of the Workshop on Linguistic Coreference, page 74--78. Granada, LREC, (1998)
- (2009)
- (2010)accepted .
- Proceedings of the 22nd International Conference on Computational Linguistics, 1, page 1105--1112. Manchester, Association for Computational Linguistics, Association for Computational Linguistics, (2008)
- Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 1, page 8--15. Sapporo, Japan, Association for Computational Linguistics, Association for Computational Linguistics, (2003)
- Proceedings of Treebanks and Linguistic Theories TLT '07, Bergen, Norway, NEALT, (2007)


user