Article,

Effective Sparse Matrix Representation for the GPU Architectures

B. Neelima1, and P. Raghavendra2.
International Journal of Computer Science, Engineering and Applications (IJCSEA), 02 (02): 151-165 (April 2012)
DOI: 10.5121/ijcsea.2012.2213

Full text

Abstract

General purpose computation on graphics processing unit (GPU) is prominent in the high performance computing era of this time. Porting or accelerating the data parallel applications onto GPU gives the default performance improvement because of the increased computational units. Better performances can be seen if application specific fine tuning is done with respect to the architecture under consideration. One such very widely used computation intensive kernel is sparse matrix vector multiplication (SPMV) in sparse matrix based applications. Most of the existing data format representations of sparse matrix are developed with respect to the central processing unit (CPU) or multi cores. This paper gives a new format for sparse matrix representation with respect to graphics processor architecture that can give 2x to 5x performance improvement compared to CSR (compressed row format), 2x to 54x performance improvement with respect to COO (coordinate format) and 3x to 10 x improvement compared to CSR vector format for the class of application that fit for the proposed new format. It also gives 10% to 133% improvements in memory transfer (of only access information of sparse matrix) between CPU and GPU. This paper gives the details of the new format and its requirement with complete experimentation details and results of comparison.

BibTeX key: noauthororeditor
entry type: article
year: 2012
month: April
journal: International Journal of Computer Science, Engineering and Applications (IJCSEA)
number: 02
pages: 151-165
volume: 02
issn: 2230 - 9616
DOI: 10.5121/ijcsea.2012.2213
Document: http://airccse.org/journal/ijcsea/papers/2212ijcsea13.pdf

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{noauthororeditor, abstract = {General purpose computation on graphics processing unit (GPU) is prominent in the high performance computing era of this time. Porting or accelerating the data parallel applications onto GPU gives the default performance improvement because of the increased computational units. Better performances can be seen if application specific fine tuning is done with respect to the architecture under consideration. One such very widely used computation intensive kernel is sparse matrix vector multiplication (SPMV) in sparse matrix based applications. Most of the existing data format representations of sparse matrix are developed with respect to the central processing unit (CPU) or multi cores. This paper gives a new format for sparse matrix representation with respect to graphics processor architecture that can give 2x to 5x performance improvement compared to CSR (compressed row format), 2x to 54x performance improvement with respect to COO (coordinate format) and 3x to 10 x improvement compared to CSR vector format for the class of application that fit for the proposed new format. It also gives 10% to 133% improvements in memory transfer (of only access information of sparse matrix) between CPU and GPU. This paper gives the details of the new format and its requirement with complete experimentation details and results of comparison.}, added-at = {2018-07-26T15:01:19.000+0200}, author = {Neelima1, B. and Raghavendra2, Prakash S.}, biburl = {https://www.bibsonomy.org/bibtex/2b071a9d6dd54aa397bbdcd625359bf06/ijcsea}, doi = {10.5121/ijcsea.2012.2213}, interhash = {9dc8f83c9c00318c33806d484d701188}, intrahash = {b071a9d6dd54aa397bbdcd625359bf06}, issn = {2230 - 9616}, journal = {International Journal of Computer Science, Engineering and Applications (IJCSEA)}, keywords = {algorithms networks}, month = {April}, number = 02, pages = {151-165}, timestamp = {2018-07-26T15:01:19.000+0200}, title = {Effective Sparse Matrix Representation for the GPU Architectures}, url = {http://airccse.org/journal/ijcsea/papers/2212ijcsea13.pdf}, volume = 02, year = 2012 }

BibSonomy

Effective Sparse Matrix Representation for the GPU Architectures

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on