Author of the publication

Tuning basic Linear Algebra Routines for Hybrid CPU+GPU Platforms.

, , , and . ICCS, volume 29 of Procedia Computer Science, page 30-39. Elsevier, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Deploying deep learning approaches to left ventricular non-compaction measurement., , , and . J. Supercomput., 77 (9): 10138-10151 (2021)A self-optimized software tool for quantifying the degree of left ventricle hyper-trabeculation., , , and . J. Supercomput., 75 (3): 1625-1640 (2019)Expanding the deep-learning model to diagnosis LVNC: Limitations and trade-offs., , , , and . CoRR, (2023)Optimizing a 3D-FWT Code in a Heterogeneous Cluster of Multicore CPUs and Manycore GPUs., , and . SBAC-PAD, page 97-104. IEEE Computer Society, (2013)Reducing 3D Wavelet Transform Execution Time through the Streaming SIMD Extensions., , and . PDP, page 49-56. IEEE Computer Society, (2003)Code Detection for Hardware Acceleration Using Large Language Models., , and . IEEE Access, (2024)An efficient implementation of a 3D wavelet transform based encoder on hyper-threading technology., , , , and . Parallel Comput., 33 (1): 54-72 (2007)HDNN: a cross-platform MLIR dialect for deep neural networks., , and . J. Supercomput., 78 (11): 13814-13830 (2022)An Autotuning Engine for the 3D Fast Wavelet Transform on Clusters with Hybrid CPU + GPU Platforms., , and . Int. J. Parallel Program., 43 (6): 1160-1191 (2015)A lossy 3D wavelet transform for high-quality compression of medical video., , and . J. Syst. Softw., 82 (3): 526-534 (2009)