Author of the publication

Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization.

, , , , and . COMPSAC (1), page 579-586. IEEE Computer Society, (2009)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An analysis of energy-optimized lattice-Boltzmann CFD simulations from the chip to the highly parallel level, , , and . CoRR, (2013)Comparison of different propagation steps for lattice Boltzmann methods., , , and . Comput. Math. Appl., 65 (6): 924-935 (2013)EXASTEEL: Towards a Virtual Laboratory for the Multiscale Simulation of Dual-Phase Steel Using High-Performance Computing., , , , , , , , , and 4 other author(s). Software for Exascale Computing, volume 136 of Lecture Notes in Computational Science and Engineering, Springer, (2020)Asynchronous Checkpointing by Dedicated Checkpoint Threads., , , and . EuroMPI, volume 7490 of Lecture Notes in Computer Science, page 289-290. Springer, (2012)Modeling and analyzing performance for highly optimized propagation steps of the lattice Boltzmann method on sparse lattices., , , and . CoRR, (2014)Short Note on Costs of Floating Point Operations on current x86-64 Architectures: Denormals, Overflow, Underflow, and Division by Zero., , , and . CoRR, (2015)Hybrid Parallel Multigrid Methods for Geodynamical Simulations., , , , , , , , , and 4 other author(s). Software for Exascale Computing, volume 113 of Lecture Notes in Computational Science and Engineering, Springer, (2016)Hardware-effiziente, hochparallele Implementierungen von Lattice-Boltzmann-Verfahren für komplexe Geometrien.. University of Erlangen-Nuremberg, Germany, (2016)Energy efficiency of nonlinear domain decomposition methods., , , , and . Int. J. High Perform. Comput. Appl., (2021)A Proof of Concept for Optimizing Task Parallelism by Locality Queues, and . CoRR, (2009)