Although term extraction has been researched for more than 20 years, only a few studies focus on under-resourced languages. Moreover, bilingual term mapping from comparable corpora for these languages has attracted researchers only recently. This paper presents methods for term extraction, term tagging in documents, and bilingual term mapping from comparable corpora for four under-resourced languages: Croatian, Latvian, Lithuanian, and Romanian. Methods described in this paper are language independent as long as language specific parameter data is provided by the user and the user has access to a part of speech or a morpho-syntactic tagger.
M. Schwab, R. Jäschke, und F. Fischer. Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Seite 110--115. Association for Computational Linguistics, (2023)
M. Schwab, R. Jäschke, und F. Fischer. Proceedings of the 5th International Conference on Natural Language and Speech Processing, Seite 282--287. Association for Computational Linguistics, (2022)
F. Arnold, und R. Jäschke. Proceedings of the Workshop Understanding LIterature references in academic full TExt at JCDL 2022, Volume 3220 von ULITE-ws '22, Seite 7--15. CEUR Workshop Proceedings, (2022)
M. Javidi, und E. Roshan. Speech Emotion Recognition by Using Combinations of Support Vector Machine (SVM), and C5.0, 1, Seite 21 - 33. Applied Mathematics and Sciences: An International Journal (MathSJ), (August 2014)
G. Muzny, M. Fang, A. Chang, und D. Jurafsky. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, Seite 460--470. Valencia, Spain, Association for Computational Linguistics, (April 2017)
C. Scheible, R. Klinger, und S. Padó. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Seite 1736--1745. Berlin, Germany, Association for Computational Linguistics, (August 2016)