Article,

Marky: A tool supporting annotation consistency in multi-user and iterative document annotation projects

, , , and .
Computer Methods and Programs in Biomedicine, 118 (2): 242 - 251 (2015)
DOI: http://dx.doi.org/10.1016/j.cmpb.2014.11.005

Abstract

AbstractBackground and objectives Document annotation is a key task in the development of Text Mining methods and applications. High quality annotated corpora are invaluable, but their preparation requires a considerable amount of resources and time. Although the existing annotation tools offer good user interaction interfaces to domain experts, project management and quality control abilities are still limited. Therefore, the current work introduces Marky, a new Web-based document annotation tool equipped to manage multi-user and iterative projects, and to evaluate annotation quality throughout the project life cycle. Methods At the core, Marky is a Web application based on the open source CakePHP framework. User interface relies on \HTML5\ and \CSS3\ technologies. Rangy library assists in browser-independent implementation of common \DOM\ range and selection tasks, and Ajax and \JQuery\ technologies are used to enhance user–system interaction. Results Marky grants solid management of inter- and intra-annotator work. Most notably, its annotation tracking system supports systematic and on-demand agreement analysis and annotation amendment. Each annotator may work over documents as usual, but all the annotations made are saved by the tracking system and may be further compared. So, the project administrator is able to evaluate annotation consistency among annotators and across rounds of annotation, while annotators are able to reject or amend subsets of annotations made in previous rounds. As a side effect, the tracking system minimises resource and time consumption. Conclusions Marky is a novel environment for managing multi-user and iterative document annotation projects. Compared to other tools, Marky offers a similar visually intuitive annotation experience while providing unique means to minimise annotation effort and enforce annotation quality, and therefore corpus consistency. Marky is freely available for non-commercial use at http://sing.ei.uvigo.es/marky.

Tags

Users

  • @isaric1

Comments and Reviews