We propose a novel attention network for document annotation with user-generated tags. The network is designed according to the human reading and annotation behaviour. Usually, users try to digest the title and obtain a rough idea about the topic first, and then read the content of the document. Present research shows that the title metadata could largely affect the social annotation. To better utilise this information, we design a framework that separates the title from the content of a document and apply a title-guided attention mechanism over each sentence in the content. We also propose two semanticbased loss regularisers that enforce the output of the network to conform to label semantics, i.e. similarity and subsumption. We analyse each part of the proposed system with two real-world open datasets on publication and question annotation. The integrated approach, Joint Multi-label Attention Network (JMAN), significantly outperformed the Bidirectional Gated Recurrent Unit (Bi-GRU) by around 13%-26% and the Hierarchical Attention Network (HAN) by around 4%-12% on both datasets, with around 10%-30% reduction of training time.
D. Skoutas, and M. Alrifai. In Proc. of 20th ACM international conference on Information and knowledge management (CIKM '11), ACM, New York, NY, USA, 221-230., (2011)
J. Illig, A. Hotho, R. Jäschke, and G. Stumme. Knowledge Processing and Data Analysis, volume 6581 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 10.1007/978-3-642-22140-8_9.(2011)
J. Illig, A. Hotho, R. Jäschke, and G. Stumme. Knowledge Processing and Data Analysis, volume 6581 of Lecture Notes in Computer Science, page 136--149. Berlin/Heidelberg, Springer, (2011)
J. Illig, A. Hotho, R. Jäschke, and G. Stumme. Knowledge Processing and Data Analysis, volume 6581 of Lecture Notes in Computer Science, page 136--149. Berlin/Heidelberg, Springer, (2011)
C. Cattuto, D. Benz, A. Hotho, and G. Stumme. The Semantic Web -- ISWC 2008, Proc.Intl. Semantic Web Conference 2008, volume 5318 of LNAI, page 615--631. Heidelberg, Springer, (2008)
C. Cattuto, D. Benz, A. Hotho, and G. Stumme. The Semantic Web - ISWC 2008, volume 5318 of Lecture Notes in Computer Science, page 615--631. Springer Berlin / Heidelberg, (2008)
C. Preisach, L. Marinho, and L. Schmidt-Thieme. Advances in Knowledge Discovery and Data Mining, volume 6118 of Lecture Notes in Computer Science, Springer, (2010)
S. Lohmann, P. Heim, L. Tetzlaff, T. Ertl, and J. Ziegler. Proceedings of the 4th International Conference on Semantic and Digital Media Technologies (SAMT 2009), page 16--27. Berlin, Heidelberg, Springer, (2009)
C. Cattuto, D. Benz, A. Hotho, and G. Stumme. The Semantic Web -- ISWC 2008, Proc.Intl. Semantic Web Conference 2008, volume 5318 of LNAI, page 615--631. Heidelberg, Springer, (2008)
J. Illig, A. Hotho, R. Jäschke, and G. Stumme. Knowledge Processing and Data Analysis, volume 6581 of Lecture Notes in Computer Science, page 136--149. Berlin/Heidelberg, Springer, (2011)