Author of the publication

State-of-the-art eigensolvers for electronic structure calculations of large scale nano-systems.

, , , , , and . J. Comput. Phys., 227 (15): 7113-7124 (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling., , and . ICCS (1), volume 5544 of Lecture Notes in Computer Science, page 195-204. Springer, (2009)GPU-Aware Non-contiguous Data Movement In Open MPI., , , , and . HPDC, page 231-242. ACM, (2016)Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems., , , and . SPAA, page 280-288. ACM, (2007)CPU-GPU hybrid bidiagonal reduction with soft error resilience., , , and . ScalA@SC, page 2:1-2:5. ACM, (2013)Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers., , , and . SC, page 47:1-47:11. IEEE / ACM, (2018)Optimized Batched Linear Algebra for Modern Architectures., , , , and . Euro-Par, volume 10417 of Lecture Notes in Computer Science, page 511-522. Springer, (2017)Selected Results from the ParkBench Benchmark., , and . Euro-Par, Vol. II, volume 1124 of Lecture Notes in Computer Science, page 251-254. Springer, (1996)ADAPT: an event-based adaptive collective communication framework., , , , , and . HPDC, page 118-130. ACM, (2018)Self adaptivity in Grid computing., and . Concurr. Pract. Exp., 17 (2-4): 235-257 (2005)The design and implementation of the parallel out-of-core ScaLAPACK LU, QR, and Cholesky factorization routines., and . Concurr. Pract. Exp., 12 (15): 1481-1493 (2000)