CUDA lets you work with familiar programming concepts while developing software that can run on a GP This is the first of a series of articles to introduce you to the power of CUDA -- through working code -- and to the thought process to help you map applications onto multi-threaded hardware (such as GPUs) to get big performance increases. Of course, not all problems can be mapped efficiently onto multi-threaded hardware, so part of my thought process will be to distinguish what will and what won't work, plus provide a common-sense idea of what might work "well-enough". "CUDA programming" and "GPGPU programming" are not the same (although CUDA runs on GPUs). CUDA permits working with familiar programming concepts while developing software that can run on a GPU. It also avoids the performance overhead of graphics layer APIs by compiling your software directly to the hardware (GPU assembly language, for instance), thereby providing great performance.
OpenVIDIA : GPU accelerated Computer Vision Library The OpenVIDIA project implements computer vision algorithms on computer graphics hardware, using OpenGL and Cg. The project provides useful example programs which run real time computer vision algorit
R. Okuta, Y. Unno, D. Nishino, S. Hido, and C. Loomis. Proceedings of Workshop on Machine Learning Systems (LearningSys) in The Thirty-first Annual Conference on Neural Information Processing Systems (NIPS), (2017)
D. Chang, A. Desoky, M. Ouyang, and E. Rouchka. 2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing, page 501--506. IEEE, (2009)
G. Capannini, F. Silvestri, and R. Baraglia. Information Processing & Management, 48 (5):
903--917(2012)Large-Scale and Distributed Systems for Information Retrieval.
C. Hong, D. Chen, W. Chen, W. Zheng, and H. Lin. Proceedings of the 19th International Conference on Parallel
Architectures and Compilation Techniques, page 217--226. New York, NY, USA, ACM, (2010)
A. Cheik Ahamed, and F. Magoulès. Distributed Computing and Applications to Business, Engineering Science (DCABES), 2013 12th International Symposium on, page 16-20. (September 2013)
F. Magoulès, A. Cheik Ahamed, A. Desmaison, J. Lechenet, F. Mayer, H. Ben Salem, and T. Zhu. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, page 304-311. (August 2014)
A. Cheik Ahamed, and F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, page 121-128. (August 2014)
A. Cheik Ahamed, and F. Magoulès. Distributed Computing and Applications to Business, Engineering and Science (DCABES), 2014 13th International Symposium on, page 19-23. (November 2014)
A. Cheik Ahamed, A. Desmaison, and F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, page 129-136. (August 2014)
L. Lai, C. Lai, A. Cheik Ahamed, and F. Magoules. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, page 137-144. (August 2014)