Misc,

End-to-End Multi-Task Learning with Attention

xxx.
(Mar 28, 2018)

Abstract

In this paper, we propose a novel multi-task learning architecture, which incorporates recent advances in attention mechanisms. Our approach, the Multi-Task Attention Network (MTAN), consists of a single shared network containing a global feature pool, together with task-specific soft-attention modules, which are trainable in an end-to-end manner. These attention modules allow for learning of task-specific features from the global pool, whilst simultaneously allowing for features to be shared across different tasks. The architecture can be built upon any feed-forward neural network, is simple to implement, and is parameter efficient. Experiments on the CityScapes dataset show that our method outperforms several baselines in both single-task and multi-task learning, and is also more robust to the various weighting schemes in the multi-task loss function. We further explore the effectiveness of our method through experiments over a range of task complexities, and show how our method scales well with task complexity compared to baselines.

BibTeX key: citeulike:14579166
entry type: misc
year: 2018
month: mar
day: 28
citeulike-article-id: 14579166
citeulike-linkout-1: http://arxiv.org/pdf/1803.10704
priority: 0
posted-at: 2018-05-01 16:03:49
eprint: 1803.10704
citeulike-linkout-0: http://arxiv.org/abs/1803.10704
archiveprefix: arXiv
url: http://arxiv.org/abs/1803.10704

BibSonomy

End-to-End Multi-Task Learning with Attention

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on