Author of the publication

Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX-Aurora and RISC-V vector extension.

, , , and . Concurr. Comput. Pract. Exp., (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Quantifying the Potential Task-Based Dataflow Parallelism in MPI Applications., , , , and . Euro-Par (1), volume 6852 of Lecture Notes in Computer Science, page 39-51. Springer, (2011)Unrolling Loops Containing Task Parallelism., , , and . LCPC, volume 5898 of Lecture Notes in Computer Science, page 416-423. Springer, (2009)Barcelona OpenMP Tasks Suite: A Set of Benchmarks Targeting the Exploitation of Task Parallelism in OpenMP, , , , and . ICPP, page 124-131. IEEE Computer Society, (2009)An Extension to Improve OpenMP Tasking Control., , , , , , and . IWOMP, volume 6132 of Lecture Notes in Computer Science, page 56-69. Springer, (2010)Optimizing Overlapped Memory Accesses in User-directed Vectorization., , , , and . ICS, page 393-404. ACM, (2015)Achieving high memory performance from heterogeneous architectures with the SARC programming model., , , , and . MEDEA@PACT, page 15-21. ACM, (2009)Software Development Vehicles to Enable Extended and Early Co-design: A RISC-V and HPC Case of Study., , , , , , , , and . ISC Workshops, volume 13999 of Lecture Notes in Computer Science, page 526-537. Springer, (2023)DPU Offloading Programming with the OpenMP API., , , and . SC Workshops, page 884-891. ACM, (2023)Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL., , , , , , , , and . LCPC, volume 6548 of Lecture Notes in Computer Science, page 215-229. Springer, (2010)Performance and energy effects on task-based parallelized applications - User-directed versus manual vectorization., , , , , , , and . J. Supercomput., 74 (6): 2627-2637 (2018)