Author of the publication

Fine Tuning Matrix Multiplications on Multicore.

, , and . HiPC, volume 5374 of Lecture Notes in Computer Science, page 30-41. Springer, (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fine Tuning Matrix Multiplications on Multicore., , and . HiPC, volume 5374 of Lecture Notes in Computer Science, page 30-41. Springer, (2008)Quantifying performance bottleneck cost through differential analysis., , , , , and . ICS, page 263-272. ACM, (2013)Simsys: a performance simulation framework., , , , , and . RAPIDO, page 1:1-1:8. ACM, (2013)The Cedar System and an Initial Performance Study., , , , , , , , , and 12 other author(s). ISCA, page 213-223. ACM, (1993)Software prefetch on core micro-architecture applied to irregular codes., , , and . HPCS, page 264-272. IEEE, (2011)To copy or not to copy: a compile-time technique for assessing when data copying should be used to eliminate cache conflicts., , and . SC, page 410-419. ACM, (1993)Loop Optimization using Hierarchical Compilation and Kernel Decomposition., , , , and . CGO, page 170-184. IEEE Computer Society, (2007)OCEANS - Optimising Compilers for Embedded Applications., , , , , , , , , and 9 other author(s). Euro-Par, volume 1685 of Lecture Notes in Computer Science, page 1171-1175. Springer, (1999)Improving Load/Store Queues Usage in Scientific Computing., , and . ICPP, page 38-45. IEEE Computer Society, (2004)XOR-Schemes: A Flexible Data Organization in Parallel Memories., , and . ICPP, page 276-283. IEEE Computer Society Press, (1985)