Modern graphics processing units (GPUs) contain hundreds of arithmetic units and can be harnessed to provide tremendous acceleration for many numerically intensive scientific applications. The key to effective utilization of GPUs for scientific computing
Harte Zahlen zur Rechenleistung künftiger Nvidia-GPUs gab es jedoch nicht, die Einheit, in der Huangs Roadmap auf der Vertikalen skaliert ist "GFlops pro Watt". Wie der Nvidia-Mitbegründer betonte, ist die Rechenleistung nicht das Problem, sondern die "Power Wall". Schon mit den ersten Fermi-Grafikkarten kratzte Nvidia an der Grenze von 300 Watt.
provides a software development platform that allows developers to take advantage of a new generation of high performance processors. These new processors, including GPUs, the IBM Cell, and other multi-core processors
Marvin is a deep learning framework designed first and foremost to be hackable. It is naively simple for fast prototyping, uses only basic C/C++, and only calls CUDA and cuDNN as dependencies.
C. Abal-Kassim, und M. Frédéric. Distributed Computing and Applications to Business, Engineering and Science (DCABES), 2014 13th International Symposium on, Seite 46-50. (ноября 2014)
J. Auerbach, D. Bacon, P. Cheng, und R. Rabbah. OOPSLA '10: Proceedings of the ACM international conference on Object oriented programming systems languages and applications, Seite 89--108. New York, NY, USA, ACM, (2010)
M. Bauer, H. Cook, und B. Khailany. Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Seite 12:1--12:11. New York, NY, USA, ACM, (2011)
E. Berger, S. Stern, und J. Pizzorno. 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23), Boston, MA, USENIX Association, (июля 2023)
G. Capannini, F. Silvestri, und R. Baraglia. Information Processing & Management, 48 (5):
903--917(2012)Large-Scale and Distributed Systems for Information Retrieval.
D. Chang, A. Desoky, M. Ouyang, und E. Rouchka. 2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing, Seite 501--506. IEEE, (2009)
A. Cheik Ahamed, A. Desmaison, und F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, Seite 129-136. (августа 2014)
A. Cheik Ahamed, und F. Magoulès. Distributed Computing and Applications to Business, Engineering Science (DCABES), 2013 12th International Symposium on, стр. 16-20. (сентября 2013)
A. Cheik Ahamed, и F. Magoulès. Distributed Computing and Applications to Business, Engineering and Science (DCABES), 2014 13th International Symposium on, стр. 19-23. (ноября 2014)
A. Cheik Ahamed, и F. Magoulès. High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS), 2014 IEEE Intl Conf on, стр. 121-128. (августа 2014)
J. Choi, A. Singh, и R. Vuduc. Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, стр. 115--126. New York, NY, USA, ACM, (2010)
A. Drozd, N. Maruyama, и S. Matsuoka. Proceedings of the 2011 companion on High Performance Computing Networking, Storage and Analysis Companion, стр. 21--22. New York, NY, USA, ACM, (2011)
N. Govindaraju, J. Gray, R. Kumar, и D. Manocha. Proceedings of the 2006 ACM SIGMOD international conference on Management of data, стр. 325--336. New York, NY, USA, ACM, (2006)