copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Alignment of Segmentation and Labelling for Arabic Morphosyntactic Taggers

A. Alosaimy, and E. Atwell. International Journal of Computational Linguistics (IJCL), 9 (1): 1-12 (April 2018)

Abstract

We present and compare three methods of alignment between morphemes resulting from four different Arabic POS-taggers as well as one baseline method using only provided labels. We combined four Arabic POS-taggers: MADAMIRA (MA), Stanford Tagger (ST), AMIRA (AM), Farasa (FA); and as the target output used two Classical Arabic gold standards: Quranic Arabic Corpus (QAC) and SALMA Standard Arabic Linguistics Morphological Analysis (SAL). We justify why we opt to use label for aligning instead of word form. The problem is not trivial as it is tackling six different tokenisation and labelling standards. The supervised learning using a unigram model scored the best segment alignment accuracy, correctly aligning 97% of morpheme segments. We then evaluated the alignment methods extrinsically, in terms of their effect in improving accuracy of ensemble POS-taggers, merging different combinations of the four Arabic POS-taggers. Using the best approach to align input POS taggers, ensemble tagger has correctly segmented and tagged 88.09% of morphemes. We show how increasing the number of input taggers raise the accuracy, suggesting that input taggers make different errors.

Links and resources

BibTeX key: alosaimy2018joint
entry type: article
year: 2018
month: April
journal: International Journal of Computational Linguistics (IJCL)
number: 1
pages: 1-12
volume: 9
language: English
issn: 2180-1266
url: http://www.cscjournals.org/library/manuscriptinfo.php?mc=IJCL-84

Cite this publication

@article{alosaimy2018joint, abstract = {We present and compare three methods of alignment between morphemes resulting from four different Arabic POS-taggers as well as one baseline method using only provided labels. We combined four Arabic POS-taggers: MADAMIRA (MA), Stanford Tagger (ST), AMIRA (AM), Farasa (FA); and as the target output used two Classical Arabic gold standards: Quranic Arabic Corpus (QAC) and SALMA Standard Arabic Linguistics Morphological Analysis (SAL). We justify why we opt to use label for aligning instead of word form. The problem is not trivial as it is tackling six different tokenisation and labelling standards. The supervised learning using a unigram model scored the best segment alignment accuracy, correctly aligning 97% of morpheme segments. We then evaluated the alignment methods extrinsically, in terms of their effect in improving accuracy of ensemble POS-taggers, merging different combinations of the four Arabic POS-taggers. Using the best approach to align input POS taggers, ensemble tagger has correctly segmented and tagged 88.09% of morphemes. We show how increasing the number of input taggers raise the accuracy, suggesting that input taggers make different errors.}, added-at = {2018-12-12T05:26:25.000+0100}, author = {Alosaimy, Abdulrahman and Atwell, Eric}, biburl = {https://www.bibsonomy.org/bibtex/2cc409aa04477d3cb925ef7a9840fb3e8/cscjournals}, interhash = {9734640de222d07f3832e0447e46610f}, intrahash = {cc409aa04477d3cb925ef7a9840fb3e8}, issn = {2180-1266}, journal = {International Journal of Computational Linguistics (IJCL)}, keywords = {Alignment. Arabic, Morphological POS-Tagging, Segmentation, Tokenisation,}, language = {English}, month = {April}, number = 1, pages = {1-12}, timestamp = {2018-12-12T05:26:25.000+0100}, title = {Joint Alignment of Segmentation and Labelling for Arabic Morphosyntactic Taggers}, url = {http://www.cscjournals.org/library/manuscriptinfo.php?mc=IJCL-84}, volume = 9, year = 2018 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Alignment of Segmentation and Labelling for Arabic Morphosyntactic Taggers

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Joint Alignment of Segmentation and Labelling for Arabic Morphosyntactic Taggers

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Joint Alignment of Segmentation and Labelling for Arabic Morphosyntactic Taggers

Comments and Reviews
(0)