Author of the publication

An Efficient Vectorization Approach to Nested Thread-level Parallelism for CUDA GPUs.

, and . PACT, page 488-489. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Group Project - with a Twist!, and . Proceedings of the Western Canadian Conference on Computing Education, (May 2005)Heterogeneous Multiconstraint Application Partitioner (HMAP)., , , and . TrustCom/ISPA/IUCC, page 999-1007. IEEE Computer Society, (2013)A Parallel Runtime Framework for Communication Intensive Stream Applications., , and . TrustCom/ISPA/IUCC, page 1179-1187. IEEE Computer Society, (2013)An Optimized Java Interpreter for Connected Devices and Embedded Systems., , , and . SAC, page 692-697. ACM, (2003)Real-Time Sensor Signal Capture from a Harsh Environment., , and . DS-RT, page 36-43. IEEE Computer Society, (2012)Towards Optimal Sorting Networks: The Third Level., and . CoRR, (2015)Virtual machine showdown: stack versus registers., , , and . VEE, page 153-163. ACM, (2005)Comparing integer data structures for 32- and 64-bit keys., and . ACM Journal of Experimental Algorithmics, (2010)Automatic Customization of Embedded Applications for Enhanced Performance and Reduced Power Using Optimizing Compiler Techniques., , and . Euro-Par, volume 3149 of Lecture Notes in Computer Science, page 318-327. Springer, (2004)Comparing Integer Data Structures for 32 and 64 Bit Keys., and . WEA, volume 5038 of Lecture Notes in Computer Science, page 28-42. Springer, (2008)