BibSonomy ::
concept ::
user :: unhammer ::
The blue social bookmark and publication sharing system.
< >
- monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some languages) spoken data for fo...monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some languages) spoken data for fourteen South Asian languages: Assamese, Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Oriya, Punjabi, Sinhala, Tamil, Telegu and Urdu. The EMILLE monolingual corpora contain approximately 92,799,000 words (including 2,627,000 words of transcribed spoken data for Bengali, Gujarati, Hindi, Punjabi and Urdu). The parallel corpus consists of 200,000 words of text in English and its accompanying translations in Hindi, Bengali, Punjabi, Gujarati and Urdu. The annotated component includes the Urdu monolingual and parallel corpora annotated for parts-of-speech, together with twenty written Hindi corpus files annotated to show the nature of demonstrative use. The corpus is marked up using CES-compliant SGML, and encoded using Unicode.
- Very Google Suggest-like
- Documentation for LFG resources at UiB, by Paul Meurer
- Language Log post by Arnold Zwicky
- I: Ø. Andersen, K. Fløttum og T. Kinn (red.): Menneske, språk og fellesskap. Festskrift til Kirsti Koch Christensen på 60-årsdagen 1. desember 2000, Novus ...I: Ø. Andersen, K. Fløttum og T. Kinn (red.): Menneske, språk og fellesskap. Festskrift til Kirsti Koch Christensen på 60-årsdagen 1. desember 2000, Novus forlag, s. 25-45.
< >
- Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation, page 35--42. Alicante, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, (2009)
- COLING-GEE '02 Proceedings of the 2002 Workshop on Grammar Engineering and Evaluation, 15, page 1--7. Morristown, NJ, Association for Computational Linguistics, Association for Computational Linguistics, (2002)
- Proceedings of the Corpus Linguistics 2001 Conference, page 466--475. Lancaster, UK, UCREL, (2001)
- Complex Predicates in Nonderivational Syntax, volume 30 of Syntax and Semantics, chapter 1, Academic Press, New York, (1998)
- Proceedings of the 13th Annual Conference of the European Association of Machine Translation, EAMT09, (2009)
- Parallel Distributed Processing, 2, MIT Press, (1986)
- Papers from the 16th Scandinavian Conference of Linguistics, Turku/Åbo, Finland, (1996)
- Creating and Using English Language Corpora, 13, Amsterdam, Editions Rodopi, (1994)
- Proceedings of DECALOG SEMDIAL07, page 157--158. Trento, Italy, (2007)
- EACL 1995, page 149--156. Belfield, Dublin, Ireland, University College Dublin, (1995)
- (2008)Submitted to the Research Council of Norway. .
- Menneske, språk og fellesskap. Festskrift til Kirsti Koch Christensen på 60-årsdagen 1. desember 2000, Novus Forlag, Oslo, (2000)
- Menneske, språk og fellesskap. Festskrift til Kirsti Koch Christensen på 60-årsdagen 1. desember 2000, Novus Forlag, Oslo, (2000)
- ACL '02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, page 392--399. Morristown, NJ, USA, Association for Computational Linguistics, (2001)
- The Prague Bulletin of Mathematical Linguistics (2009)
- (2006)
- Englisch in Zeit und Raum-English in Time and Space: Forschungsbericht für Klaus Faiss. Trier: Wissenschaftlicher Verlag Trier (2006)
- A Festschrift for Kjell Johan Sæbø -- in partial fulfilment of the requirements for the celebration of his 50th birthday, Unipub, Oslo, (2006)
- Nordic Journal of Linguistics 30(02):185--208 (2007)
- (2008)


