Engineer friends often ask me: Graph Deep Learning sounds great, but are there any big commercial success stories? Is it being deployed in practical applications? Besides the obvious ones–recommendation systems at Pinterest, Alibaba and Twitter–a slightly nuanced success story is the Transformer architecture, which has taken the NLP industry by storm. Through this post, I want to establish links between Graph Neural Networks (GNNs) and Transformers. I’ll talk about the intuitions behind model architectures in the NLP and GNN communities, make connections using equations and figures, and discuss how we could work together to drive progress.
X. Zhang, and Y. LeCun. (2015)cite arxiv:1502.01710Comment: This technical report is superseded by a paper entitled "Character-level Convolutional Networks for Text Classification", arXiv:1509.01626. It has considerably more experimental results and a rewritten introduction.
C. Cummins, P. Petoumenos, Z. Wang, and H. Leather. Proceedings of the 2017 International Symposium on Code Generation and Optimization, page 86–99. IEEE Press, (2017)
N. Kimura, M. Kono, and J. Rekimoto. Proceedings of the 2019 CHI Conference on Human Factors in
Computing Systems, Paper 146, page 1--11. New York, NY, USA, Association for Computing Machinery, (May 2019)