From post

Performance Analysis of the Kahan-Enhanced Scalar Product on Current Multicore Processors.

, , , , , и . PPAM (1), том 9573 из Lecture Notes in Computer Science, стр. 63-73. Springer, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Performance Engineering for a Tall & Skinny Matrix Multiplication Kernel on GPUs., , , и . CoRR, (2019)Comparison of different propagation steps for lattice Boltzmann methods., , , и . Comput. Math. Appl., 65 (6): 924-935 (2013)Making Applications Faster by Asynchronous Execution: Slowing Down Processes or Relaxing MPI Collectives., , , и . CoRR, (2023)Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX., , , , , , и . PMBS@SC, стр. 1-7. IEEE, (2020)Core-Level Performance Engineering with the Open-Source Architecture Code Analyzer (OSACA) and the Compiler Explorer., и . ICPE (Companion), стр. 127-131. ACM, (2023)Opening the Black Box: Performance Estimation during Code Generation for GPUs., , , , и . SBAC-PAD, стр. 22-32. IEEE, (2021)SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study., , и . SC Workshops, стр. 1245-1254. ACM, (2023)Optimization of an Electromagnetics Code with Multicore Wavefront Diamond Blocking and Multi-dimensional Intra-Tile Parallelization., , , , , и . IPDPS, стр. 142-151. IEEE Computer Society, (2016)The world's fastest CPU and SMP node: Some performance results from the NEC SX-9., , и . IPDPS, стр. 1-8. IEEE, (2009)Propagation and Decay of Injected One-Off Delays on Clusters: A Case Study., , и . CLUSTER, стр. 1-10. IEEE, (2019)