Although term extraction has been researched for more than 20 years, only a few studies focus on under-resourced languages. Moreover, bilingual term mapping from comparable corpora for these languages has attracted researchers only recently. This paper presents methods for term extraction, term tagging in documents, and bilingual term mapping from comparable corpora for four under-resourced languages: Croatian, Latvian, Lithuanian, and Romanian. Methods described in this paper are language independent as long as language specific parameter data is provided by the user and the user has access to a part of speech or a morpho-syntactic tagger.
In this project, we provide our implementations of CNN [Zeng et al., 2014] and PCNN [Zeng et al.,2015] and their extended version with sentence-level attention scheme [Lin et al., 2016] .
NYT10 is originally released by the paper "Sebastian Riedel, Limin Yao, and Andrew McCallum. Modeling relations and their mentions without labeled text."
P. Pantel, and M. Pennacchiotti. ACL '06: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, page 113--120. Morristown, NJ, USA, Association for Computational Linguistics, (2006)
H. Chieu, and H. Ng. Eighteenth national conference on Artificial intelligence, page 786--791. Menlo Park, CA, USA, American Association for Artificial Intelligence, (2002)
F. Kokkoras, N. Bassiliades, and I. Vlahavas. Proceedings of the 15th International Conference on Conceptual Structures (ICCS 2007), volume 4604 of Lecture Notes in Artificial Intelligence, page 476-479. Berlin, Heidelberg, Springer-Verlag, (July 2007)
G. Paliouras. Proceedings of the 13th International Conference on Conceptual Structures (ICCS 2005), volume 3596 of Lecture Notes in Computer Science, page 119-135. Springer, (2005)
G. Angelova. Proceedings of the 13th International Conference on Conceptual Structures (ICCS 2005), volume 3596 of Lecture Notes in Computer Science, page 367-380. Springer, (2005)
M. Kayed, and K. Shaalan. IEEE Transactions on Knowledge and Data Engineering, 18 (10):
1411--1428(2006)Member-Chia-Hui Chang and Member-Moheb Ramzy Girgis.
Y. Jin, Y. Matsuo, and M. Ishizuka. Proceedings of the European Semantic Web Conference, ESWC2007, volume 4519 of Lecture Notes in Computer Science, Springer-Verlag, (July 2007)
M. Kayed, and K. Shaalan. IEEE Transactions on Knowledge and Data Engineering, 18 (10):
1411--1428(2006)Member-Chia-Hui Chang and Member-Moheb Ramzy Girgis.
H. Han, C. Giles, E. Manavoglu, H. Zha, Z. Zhang, and E. Fox. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 37--48. Washington, DC, USA, IEEE Computer Society, (2003)
A. Takasu. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
S. Huffman. Connectionist, Statistical, And Symbol Approaches to Learning for
Natural Language Processing, volume 1040, page 246-260. Springer, (1996)
A. Takasu. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
G. Gottlob, C. Koch, R. Baumgartner, M. Herzog, and S. Flesca. Proceedings of the Twenty-third ACM SIGACT-SIGMOD-SIGART Symposium
on Principles of Database Systems, June 14-16, 2004, Paris, France, page 1-12. ACM, (2004)