Author of the publication

Randomized algorithms to update partial singular value decomposition on a hybrid CPU/GPU cluster.

, , , and . SC, page 59:1-59:12. ACM, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Utilizing dataflow-based execution for coupled cluster methods., , , , , , and . CLUSTER, page 296-297. IEEE Computer Society, (2014)Parallel reduction to hessenberg form with algorithm-based fault tolerance., , , and . SC, page 88:1-88:11. ACM, (2013)Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programs., , , and . ACM Trans. Math. Softw., 14 (1): 18-32 (1988)High-performance Cholesky factorization for GPU-only execution., , , and . GPGPU@PPoPP, page 42-52. ACM, (2017)Accelerating collaborative filtering using concepts from high performance computing., , , and . BigData, page 667-676. IEEE, (2015)A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs., and . SBAC-PAD, page 229-232. IEEE, (2018)Energy efficiency and performance frontiers for sparse computations on GPU supercomputers., , and . PMAM@PPoPP, page 1-10. ACM, (2015)Towards batched linear solvers on accelerated hardware platforms., , , , and . PPOPP, page 261-262. ACM, (2015)Panel: many-task computing meets exascales., , , , and . MTAGS@SC, page 3-4. ACM, (2011)Self-Adapting Numerical Software and Automatic Tuning of Heuristics., and . International Conference on Computational Science, volume 2660 of Lecture Notes in Computer Science, page 759-770. Springer, (2003)