copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Y. Chai, S. Jin, and X. Hou. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, page 6887--6900. Online, Association for Computational Linguistics, (July 2020)

Abstract

Self-attention mechanisms have made striking state-of-the-art (SOTA) progress in various sequence learning tasks, standing on the multi-headed dot product attention by attending to all the global contexts at different locations. Through a pseudo information highway, we introduce a gated component self-dependency units (SDU) that incorporates LSTM-styled gating units to replenish internal semantic importance within the multi-dimensional latent space of individual representations. The subsidiary content-based SDU gates allow for the information flow of modulated latent embeddings through skipped connections, leading to a clear margin of convergence speed with gradient descent algorithms. We may unveil the role of gating mechanism to aid in the context-based Transformer modules, with hypothesizing that SDU gates, especially on shallow layers, could push it faster to step towards suboptimal points during the optimization process.

Links and resources

BibTeX key: chai-etal-2020-highway
entry type: inproceedings
address: Online
booktitle: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
year: 2020
month: jul
pages: 6887--6900
publisher: Association for Computational Linguistics
url: https://www.aclweb.org/anthology/2020.acl-main.616

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Highway Transformer: Self-Gating Enhanced Self-Attentive Networks

Comments and Reviews
(0)