From post

Tools and techniques for performance - Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems).

, , , , , и . SC, стр. 113. ACM Press, (2006)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Parallel QR Factorization of Block-Tridiagonal Matrices., , и . SIAM J. Sci. Comput., 42 (6): C313-C334 (2020)On the Complexity of the Block Low-Rank Multifrontal Factorization., , , и . SIAM J. Sci. Comput., (2017)Block Low-Rank Matrices with Shared Bases: Potential and Limitations of the BLR2 Format., , и . SIAM J. Matrix Anal. Appl., 42 (2): 990-1010 (2021)2LEV-D2P4: a package of high-performance preconditioners for scientific and engineering applications., , , и . Appl. Algebra Eng. Commun. Comput., 18 (3): 223-239 (2007)Performance Optimization and Modeling of Blocked Sparse Kernels., , , и . Int. J. High Perform. Comput. Appl., 21 (4): 467-484 (2007)FAST-EVP: An Engine Simulation Tool., , , , , и . HPCC, том 3726 из Lecture Notes in Computer Science, стр. 969-978. Springer, (2005)Pre-exascale Architectures: OpenPOWER Performance and Usability Assessment for French Scientific Community., , , , , , , , , и 20 other автор(ы). ISC Workshops, том 10524 из Lecture Notes in Computer Science, стр. 309-324. Springer, (2017)A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures, , , и . CoRR, (2007)Exploiting Mixed Precision Floating Point Hardware in Scientific Computations., , , , , , и . High Performance Computing Workshop, том 16 из Advances in Parallel Computing, стр. 19-36. IOS Press, (2006)Parallel tiled QR factorization for multicore architectures., , , и . Concurr. Comput. Pract. Exp., 20 (13): 1573-1590 (2008)