Author of the publication

Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort.

, , , , , , and . SIGMOD Conference, page 351-362. ACM, (2010)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Budget sampling of parametric surface patches., and . SI3D, page 131-138. ACM, (2003)Can traditional programming bridge the ninja performance gap for parallel computing applications?, , , , , , , and . Commun. ACM, 58 (5): 77-86 (2015)Fast Sort on CPUs, GPUs and Intel MIC Architectures, , , , , , and . Technical Report, Intel Labs, (2010)High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach, , , , , , and . Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, page 69:1--69:11. New York, NY, USA, ACM, (2011)Compressing Large Boolean Matrices using Reordering Techniques., , , , and . VLDB, page 13-23. Morgan Kaufmann, (2004)Can traditional programming bridge the Ninja performance gap for parallel computing applications?, , , , , , , and . ISCA, page 440-451. IEEE Computer Society, (2012)CloudRAMSort: fast and efficient large-scale distributed RAM sort on shared-nothing cluster., , , , , and . SIGMOD Conference, page 841-850. ACM, (2012)ClearPath: highly parallel collision avoidance for multi-agent simulation., , , , , , and . Symposium on Computer Animation, page 177-187. ACM, (2009)Compression Tolerant Watermarking for Image Verification., , , , and . ICIP, page 430-433. IEEE, (2000)Physical simulation for animation and visual effects: parallelization and characterization for chip multiprocessors., , , , , , , , and . ISCA, page 220-231. ACM, (2007)