Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.

Y. Tsuji, K. Osawa, Y. Ueno, A. Naruse, R. Yokota, and S. Matsuoka. ICPP Workshops, page 21:1-21:8. ACM, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Chotaro Naruse

Akira Hayashima

Akira Hori

Akira Satake

Akira Hayashima

Other publications of authors with the same name

Scalable and Practical Natural Gradient for Large-Scale Deep Learning.K. Osawa, Y. Tsuji, Y. Ueno, A. Naruse, C. Foo, and R. Yokota. CoRR, (2020)Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.Y. Tsuji, K. Osawa, Y. Ueno, A. Naruse, R. Yokota, and S. Matsuoka. ICPP Workshops, page 21:1-21:8. ACM, (2019)Parallel Top-K Algorithms on GPU: A Comprehensive Study and New Methods.J. Zhang, A. Naruse, X. Li, and Y. Wang. SC, page 76:1-76:13. ACM, (2023)Speeding Up Kernel Scheduler by Reducing Cache Misses.S. Yamamura, A. Hirai, M. Sato, M. Yamamoto, A. Naruse, and K. Kumon. USENIX Annual Technical Conference, FREENIX Track, page 275-285. USENIX, (2002)Interference-aware Incoming Message Detection for MPI Threaded Progression.M. Miwa, K. Nakashima, and A. Naruse. CCGRID, page 184-185. IEEE Computer Society, (2013)GPU Implementation of a Sophisticated Implicit Low-Order Finite Element Solver with FP21-32-64 Computation Using OpenACC.T. Yamaguchi, K. Fujita, T. Ichimura, A. Naruse, L. Maddegedara, and M. Hori. WACCPD@SC, volume 12017 of Lecture Notes in Computer Science, page 3-24. Springer, (2019)Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks.K. Osawa, Y. Tsuji, Y. Ueno, A. Naruse, R. Yokota, and S. Matsuoka. CVPR, page 12359-12367. Computer Vision Foundation / IEEE, (2019)A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements.T. Ichimura, K. Fujita, M. Horikoshi, L. Meadows, K. Nakajima, T. Yamaguchi, K. Koyama, H. Inoue, A. Naruse, K. Katsushima and 2 other author(s). IPDPS, page 620-629. IEEE Computer Society, (2018)Massively parallel algorithm and implementation of RI-MP2 energy calculation for peta-scale many-core supercomputers.M. Katouda, A. Naruse, Y. Hirano, and T. Nakajima. J. Comput. Chem., 37 (30): 2623-2633 (2016)CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUs.H. Ootomo, A. Naruse, C. Nolet, R. Wang, T. Feher, and Y. Wang. CoRR, (2023)

BibSonomy

Disambiguation of "Naruse, Akira"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.

Please choose a person to relate this publication to

Chotaro Naruse

Akira Hayashima

Akira Hori

Akira Satake

Akira Hayashima

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Naruse, Akira"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.

Please choose a person to relate this publication to

Chotaro Naruse

Akira Hayashima

Akira Hori

Akira Satake

Akira Hayashima

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance Optimizations and Analysis of Distributed Deep Learning with Approximated Second-Order Optimization Method.