Marvin is a deep learning framework designed first and foremost to be hackable. It is naively simple for fast prototyping, uses only basic C/C++, and only calls CUDA and cuDNN as dependencies.
This first post in a series on CUDA C and C++ covers the basic concepts of parallel programming on the CUDA platform with C/C++.
int i = blockDim.x * blockIdx.x + threadIdx.x
A. Dallmann, P. Beck, и J. von Gudenberg. Parallel Processing and Applied Mathematics, том 8385 из Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2014)
N. Vasilache, M. Baskaran, B. Meister, и R. Lethin. Proceedings of the 6th Workshop on General Purpose Processor
Using Graphics Processing Units, стр. 42--53. New York, NY, USA, ACM, (2013)
A. Cheik Ahamed, и F. Magoulès. Distributed Computing and Applications to Business, Engineering Science (DCABES), 2013 12th International Symposium on, стр. 16-20. (сентября 2013)
A. Cheik Ahamed, и F. Magoulès. High Performance Computing and Communication 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on, стр. 836-842. (июня 2012)
A. Cheik Ahamed, и F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, стр. 121-128. (августа 2014)
A. Cheik Ahamed, и F. Magoulès. Distributed Computing and Applications to Business, Engineering and Science (DCABES), 2014 13th International Symposium on, стр. 19-23. (ноября 2014)
C. Abal-Kassim, и M. Frédéric. Distributed Computing and Applications to Business, Engineering and Science (DCABES), 2014 13th International Symposium on, стр. 46-50. (ноября 2014)
A. Cheik Ahamed, и F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, стр. 54-61. (августа 2014)