Author of the publication

Vectorization past dependent branches through speculation.

, , and . PACT, page 353-362. IEEE Computer Society, (2013)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Minimizing startup costs for performance-critical threading., and . IPDPS, page 1-8. IEEE, (2009)Reducing Floating Point Error in Dot Product Using the Superblock Family of Algorithms., , and . SIAM J. Sci. Comput., 31 (2): 1156-1174 (2008)Scaling LAPACK panel operations using parallel cache assignment., and . PPoPP, page 223-232. ACM, (2010)Achieving Scalable Parallelization for the Hessenberg Factorization., and . CLUSTER, page 65-73. IEEE Computer Society, (2011)ATLAS Version 3.9: Overview and Status.. Software Automatic Tuning, From Concepts to State-of-the-Art Results, Springer, (2010)Scaling LAPACK panel operations using parallel cache assignment., , and . ACM Trans. Math. Softw., 39 (4): 22:1-22:30 (2013)Empirically tuning LAPACK's blocking factor for increased performance.. IMCSIT, page 303-310. IEEE, (2008)ATLAS (Automatically Tuned Linear Algebra Software).. Encyclopedia of Parallel Computing, Springer, (2011)Vectorization past dependent branches through speculation., , and . PACT, page 353-362. IEEE Computer Society, (2013)An updated set of basic linear algebra subprograms (BLAS), , , , , , , , , and 1 other author(s). ACM Transactions on Mathematical Software, 28 (2): 135--151 (2002)