<rdf:RDF xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"><channel rdf:about="http://www.bibsonomy.org/user/unhammer/Kashmiri"><title>BibSonomy bookmarks for /user/unhammer/Kashmiri</title><link>http://www.bibsonomy.org/rss/user/unhammer/Kashmiri</link><description>BibSonomy RSS Feed for /user/unhammer/Kashmiri</description><items><rdf:Seq><rdf:li rdf:resource="http://www.lancs.ac.uk/fass/projects/corpus/emille/"/></rdf:Seq></items></channel><item rdf:about="http://www.lancs.ac.uk/fass/projects/corpus/emille/"><title>The EMILLE Corpus</title><description>monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some 
  languages) spoken data for fourteen South Asian languages: Assamese, 
  Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Oriya, Punjabi, 
  Sinhala, Tamil, Telegu and Urdu. The EMILLE monolingual corpora contain 
  approximately 
92,799,000  words (including 2,627,000 words of transcribed spoken data for Bengali, Gujarati, 
  Hindi, Punjabi and Urdu). 
  The 
  parallel corpus consists of 200,000 words of text in English and its accompanying 
  translations in Hindi, Bengali, Punjabi, Gujarati and Urdu. The annotated component 
  includes the Urdu monolingual and parallel corpora annotated for parts-of-speech, 
  together with twenty written Hindi corpus files annotated to show the nature 
  of demonstrative use. The corpus is marked up using CES-compliant SGML, and 
  encoded using Unicode.</description><link>http://www.lancs.ac.uk/fass/projects/corpus/emille/</link><dc:creator>unhammer</dc:creator><dc:date>2009-04-27T14:57:07+02:00</dc:date><dc:subject>Assamese Bengali Gujarati Hindi Kannada Kashmiri Malayalam Marathi Oriya Punjabi Sinhala Tamil Telegu Urdu corpus parallel </dc:subject><content:encoded>monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some 
  languages) spoken data for fo&lt;span class=&#034;info&#034;&gt;...&lt;div&gt;monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some 
  languages) spoken data for fourteen South Asian languages: Assamese, 
  Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Oriya, Punjabi, 
  Sinhala, Tamil, Telegu and Urdu. The EMILLE monolingual corpora contain 
  approximately 
92,799,000  words (including 2,627,000 words of transcribed spoken data for Bengali, Gujarati, 
  Hindi, Punjabi and Urdu). 
  The 
  parallel corpus consists of 200,000 words of text in English and its accompanying 
  translations in Hindi, Bengali, Punjabi, Gujarati and Urdu. The annotated component 
  includes the Urdu monolingual and parallel corpora annotated for parts-of-speech, 
  together with twenty written Hindi corpus files annotated to show the nature 
  of demonstrative use. The corpus is marked up using CES-compliant SGML, and 
  encoded using Unicode.&lt;/div&gt;&lt;/span&gt;</content:encoded><taxo:topics><rdf:Bag><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Assamese"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Bengali"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Gujarati"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Hindi"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Kannada"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Kashmiri"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Malayalam"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Marathi"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Oriya"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Punjabi"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Sinhala"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Tamil"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Telegu"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/Urdu"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/corpus"/><rdf:li rdf:resource="http://www.bibsonomy.org/tag/parallel"/></rdf:Bag></taxo:topics></item></rdf:RDF>
