tag :: cuda | BibSonomy

bookmarks (hide)63
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Introducing Triton: Open-source GPU programming for neural networks
https://openai.com/research/triton
a year ago by @bshanks
show all tags
cuda
neuralnet
cudaneuralnet
(0)
copydelete
- community post
- history of this post
1Accelerated Computing with CUDA Python Workshop | NVIDIA
This workshop teaches you the fundamental tools and techniques for running GPU-accelerated Python applications using CUDA.
2 years ago by @topel
show all tags
courses
cuda
coursescuda
(0)
copydelete
- community post
- history of this post
1Deterministic python
https://twitter.com/kastnerkyle/status/1473361479143460872?t=-q6YbOcOZ2AXJ_EgW0vkJg&s=19
2 years ago by @becker
show all tags
cuda
deterministic
learning
machine
ml
numpy
python
random
seed
torch
cudadeterministiclearningmachinemlnumpypythonrandomseedtorch
(0)
copydelete
- community post
- history of this post
1GPU CUDA Technical Resources
https://www.gpuhackathons.org/technical-resources
3 years ago by @topel
show all tags
cuda
cuda
(0)
copydelete
- community post
- history of this post
1Multi-GPU CUDA stress test
http://wili.cc/blog/gpu-burn.html
5 years ago by @nosebrain
show all tags
burn
cuda
gpu
stress
test
burncudagpustresstest
(0)
copydelete
- community post
- history of this post
1need guide to build with CUDA 10.1 · Issue #26150 · tensorflow/tensorflow
https://github.com/tensorflow/tensorflow/issues/26150
5 years ago by @becker
show all tags
install
ubuntu
nvidia
gpu
tensorflow
tf
import
error
libcublas
cuda
10
cuda10
linux
python
py
installubuntunvidiagputensorflowtfimporterrorlibcublascuda10cuda10linuxpythonpy
(0)
copydelete
- community post
- history of this post
1GPU support | TensorFlow
https://www.tensorflow.org/install/gpu#software_requirements
5 years ago by @becker
show all tags
cuda
nvidia
gpu
driver
cuda10
10
import
error
tensorflow
tf
install
ubuntu
linux
python
py
cudanvidiagpudrivercuda1010importerrortensorflowtfinstallubuntulinuxpythonpy
(0)
copydelete
- community post
- history of this post
1Build from source | TensorFlow
https://www.tensorflow.org/install/source#tested_build_configurations
5 years ago by @becker
show all tags
cuda
cuda10
10
python
ubuntu
py
tensorflow
import
error
libcublas
nvidia
gpu
install
linux
cudacuda1010pythonubuntupytensorflowimporterrorlibcublasnvidiagpuinstalllinux
(0)
copydelete
- community post
- history of this post
2GitHub - wilicc/gpu-burn: Multi-GPU CUDA stress test
Multi-GPU CUDA stress test. Contribute to wilicc/gpu-burn development by creating an account on GitHub.
5 years ago by @nosebrain
show all tags
burn
cuda
gpu
burncudagpu
(0)
copydelete
- community post
- history of this post
2YOLO: Real-Time Object Detection
You only look once (YOLO) is a state-of-the-art, real-time object detection system.
6 years ago by @hotho
show all tags
cuda
darknet
deep
detection
learning
object
opencv
yolo
cudadarknetdeepdetectionlearningobjectopencvyolo
(0)
copydelete
- community post
- history of this post
1Articles | QuantStart
Article Lists: - C++ Language - Numerical Methods in C++ - GPU/CUDA Programming in C++ - Python Implementation
6 years ago by @achakraborty
show all tags
article
blog
c++
collection
cpp
cuda
finance
gpu
numerical
programming
python
reference
articleblogc++collectioncppcudafinancegpunumericalprogrammingpythonreference
(0)
copydelete
- community post
- history of this post
1CUDA-Z
http://cuda-z.sourceforge.net/
6 years ago by @achakraborty
show all tags
cuda
nvidia
software
cudanvidiasoftware
(0)
copydelete
- community post
- history of this post
1CUDA-Z
http://cuda-z.sourceforge.net
6 years ago by @hotho
show all tags
Nvidia
cuda
mac
ml
test
Nvidiacudamacmltest
(0)
copydelete
- community post
- history of this post
1Building Cross-Platform CUDA Applications with CMake | NVIDIA Developer Blog
Easily build cross-platform CUDA C++ applications using the powerful build management features of CMake. CUDA support is now a core feature of CMake.
6 years ago by @achakraborty
show all tags
article
blog
build
c++
cmake
cpp
cuda
nvidia
articleblogbuildc++cmakecppcudanvidia
(0)
copydelete
- community post
- history of this post
1Deep Learning Frameworks | NVIDIA Developer
https://developer.nvidia.com/deep-learning-frameworks
6 years ago by @achakraborty
show all tags
article
blog
cuda
deep-learning
nvidia
reference
articleblogcudadeep-learningnvidiareference
(0)
copydelete
- community post
- history of this post
1An Even Easier Introduction to CUDA | NVIDIA Developer Blog
A quick and easy introduction to CUDA programming for GPUs. This post dives into CUDA C++ with a simple, step-by-step parallel programming example.
6 years ago by @achakraborty
show all tags
blog
c++
cpp
cuda
gpu
nvidia
parallel
programming
tutorial
blogc++cppcudagpunvidiaparallelprogrammingtutorial
(0)
copydelete
- community post
- history of this post
1Marvin: Deep Learning in N Dimensions
Marvin is a deep learning framework designed first and foremost to be hackable. It is naively simple for fast prototyping, uses only basic C/C++, and only calls CUDA and cuDNN as dependencies.
6 years ago by @achakraborty
show all tags
c++
cpp
cuda
deep-learning
gpu
library
neural-networks
c++cppcudadeep-learninggpulibraryneural-networks
(0)
copydelete
- community post
- history of this post
1An Easy Introduction to CUDA C and C++
This first post in a series on CUDA C and C++ covers the basic concepts of parallel programming on the CUDA platform with C/C++. int i = blockDim.x * blockIdx.x + threadIdx.x
7 years ago by @buddybent
show all tags
cuda
cuda
(0)
copydelete
- community post
- history of this post
1Exploring K-Means in Python, C++ and CUDA – Peter Goldsborough
Implementations of K-Means in three different environments
7 years ago by @achakraborty
show all tags
article
blog
c++
cpp
cuda
k-means
machine-learning
python
articleblogc++cppcudak-meansmachine-learningpython
(0)
copydelete
- community post
- history of this post
1Making Theano Faster with CuDNN and CNMeM on Windows 10 – Ankivil
http://ankivil.com/making-theano-faster-with-cudnn-and-cnmem-on-windows-10/
7 years ago by @hprop
show all tags
cnmem
cuda
cudnn
theano
cnmemcudacudnntheano
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)68
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing
X. Bellekens, C. Tachtatzis, R. Atkinson, C. Renfrew, and T. Kirkham. Proceedings of the 7th International Conference on Security of Information and Networks, page 295:295--295:301. New York, NY, USA, ACM, (2014)
8 years ago by @xavierbe
show all tags
cloud
cuda
dpi
glop
ids
log
cloudcudadpiglopidslog
(0)
copydeleteadd this publication to your clipboard
2Data remanence and digital forensic investigation for CUDA Graphics Processing Units
X. Bellekens, G. Paul, J. Irvine, C. Tachtatzis, R. Atkinson, T. Kirkham, and C. Renfrew. 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM), page 1345-1350. (May 2015)
8 years ago by @xavierbe
show all tags
cuda
forensic
gpu
memory
remanence
cudaforensicgpumemoryremanence
(0)
copydeleteadd this publication to your clipboard
2CUDA by Example: An Introduction to General-Purpose GPU Programming
J. Sanders, and E. Kandrot. Addison-Wesley, Upper Saddle River, NJ, (2010)
8 years ago by @ytyoun
show all tags
cuda
gpu
parallel
programming
textbook
cudagpuparallelprogrammingtextbook
(0)
copydeleteadd this publication to your clipboard
1A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware
H. Shi, B. Schmidt, W. Liu, and W. Müller-Wittig. Journal of Computational Biology, 17 (4): 603--615 (2010)PMID: 20426693.
9 years ago by @ytyoun
show all tags
algorithm
bioinformatics
cuda
genome
parallel
programming
sequencing
algorithmbioinformaticscudagenomeparallelprogrammingsequencing
(0)
copydeleteadd this publication to your clipboard
2Bones: an automatic skeleton-based C-to-CUDA compiler for GPUs
C. Nugteren, and H. Corporaal. ACM Transactions on Architecture and Code, (2014)
9 years ago by @christophv
show all tags
Algorithmic_species
CUDA
Code_generation
Expose
GPU
Skeleton-based
Algorithmic_speciesCUDACode_generationExposeGPUSkeleton-based
(0)
copydeleteadd this publication to your clipboard
1Auto-tuning a high-level language targeted to GPU codes
S. Grauer-Gray, L. Xu, R. Searles, S. Ayalasomayajula, and J. Cavazos. Innovative Parallel Computing (InPar), 2012, page 1--10. (May 2012)
9 years ago by @christophv
show all tags
Abstracts
Auto-tuning
Belief_Propagation
Benchmark_testing
CUDA
DSL
Expose
GPU
GPU_codes
Graphics_processing_unit
HMPP
Nickel
OpenCL
OpenCL_code
Optimization
PolyBench_suite
Programming
Tiles
autotuning
belief_propagation
convolution_kernels
graphics_processing_unit
graphics_processing_units
high-level_directive-based_language
high_level_languages
hybrid_multicore_parallel_programming
loop_permutation
loop_tiling
loop_unrolling
multiprocessing_systems
optimization_configuration
parallel_architectures
parallel_programming
program_compilers
source-to-source_compiler
stereo_image_processing
stereo_vision
AbstractsAuto-tuningBelief_PropagationBenchmark_testingCUDADSLExposeGPUGPU_codesGraphics_processing_unitHMPPNickelOpenCLOpenCL_codeOptimizationPolyBench_suiteProgrammingTilesautotuningbelief_propagationconvolution_kernelsgraphics_processing_unitgraphics_processing_unitshigh-level_directive-based_languagehigh_level_languageshybrid_multicore_parallel_programmingloop_permutationloop_tilingloop_unrollingmultiprocessing_systemsoptimization_configurationparallel_architecturesparallel_programmingprogram_compilerssource-to-source_compilerstereo_image_processingstereo_vision
(0)
copydeleteadd this publication to your clipboard
3Finding Enclosures for Linear Systems Using Interval Matrix Multiplication in CUDA
A. Dallmann, P. Beck, and J. von Gudenberg. Parallel Processing and Applied Mathematics, volume 8385 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2014)
9 years ago by @dallmann
show all tags
cuda
enclosures
linear_systems
myown
cudaenclosureslinear_systemsmyown
(0)
copydeleteadd this publication to your clipboard
2Polyhedral Parallel Code Generation for CUDA
S. Verdoolaege, J. Carlos Juega, A. Cohen, J. Ignacio Gómez, C. Tenllado, and F. Catthoor. ACM Trans. Archit. Code Optim., 9 (4): 54:1--54:23 (January 2013)
9 years ago by @christophv
show all tags
All
C-to-CUDA
CUDA
Expose
GPU
PPCG
Par4All
Polyhedral_model
To_Read
code_generation
compilers
loop_transformations
AllC-to-CUDACUDAExposeGPUPPCGPar4AllPolyhedral_modelTo_Readcode_generationcompilersloop_transformations
(0)
copydeleteadd this publication to your clipboard
2Memory Reuse Optimizations in the R-Stream Compiler
N. Vasilache, M. Baskaran, B. Meister, and R. Lethin. Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, page 42--53. New York, NY, USA, ACM, (2013)
9 years ago by @christophv
show all tags
CUDA
GPGPU
Memory_reuse
Polyhedral_model
R-Stream
To_Read
automatic_translation
compiler_optimziation
parallelization
polyhedral_model
CUDAGPGPUMemory_reusePolyhedral_modelR-StreamTo_Readautomatic_translationcompiler_optimziationparallelizationpolyhedral_model
(0)
copydeleteadd this publication to your clipboard
2Optimizing Data Warehousing Applications for GPUs Using Kernel Fusion/Fission
H. Wu, G. Diamos, J. Wang, S. Cadambi, S. Yalamanchili, and S. Chakradhar. Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), 2012 IEEE 26th International, page 2433--2442. (May 2012)
9 years ago by @christophv
show all tags
Algorithm
Bandwidth
CPU_memory
CUDA
Efficient_query_execution
Expose
Fission
Fusion
GPU
GPU_memory
GPU_registers
Graphics_processing_unit
Kernel
Memory_management
NVIDIA_Fermi_GPU
Optimization
TPC-H
TPC-H_benchmark_suite
Throughput
Warehousing
compiler
compiler_optimizations
data_movement_reduction
data_throughput_improvements
data_transfers
data_warehouses
data_warehousing
data_warehousing_applications
general_purpose_GPU
graphics_processing_unit
graphics_processing_units
kernel_fission
kernel_fusion
loop_fission_optimization
loop_fusion_optimization
memory_reference_spatial_locality
memory_reference_temporal_locality
optimising_compilers
optimization
parallel_architectures
query_processing
redundant_operation_elimination
relational_algebra
relational_algebra_operators
relational_computation_processing
relational_query_processing
scientific_computing_community
segment_computations
storage_management
AlgorithmBandwidthCPU_memoryCUDAEfficient_query_executionExposeFissionFusionGPUGPU_memoryGPU_registersGraphics_processing_unitKernelMemory_managementNVIDIA_Fermi_GPUOptimizationTPC-HTPC-H_benchmark_suiteThroughputWarehousingcompilercompiler_optimizationsdata_movement_reductiondata_throughput_improvementsdata_transfersdata_warehousesdata_warehousingdata_warehousing_applicationsgeneral_purpose_GPUgraphics_processing_unitgraphics_processing_unitskernel_fissionkernel_fusionloop_fission_optimizationloop_fusion_optimizationmemory_reference_spatial_localitymemory_reference_temporal_localityoptimising_compilersoptimizationparallel_architecturesquery_processingredundant_operation_eliminationrelational_algebrarelational_algebra_operatorsrelational_computation_processingrelational_query_processingscientific_computing_communitysegment_computationsstorage_management
(0)
copydeleteadd this publication to your clipboard

BibSonomy

bookmarks (hide)63
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Introducing Triton: Open-source GPU programming for neural networks

1Accelerated Computing with CUDA Python Workshop | NVIDIA

1Deterministic python

1GPU CUDA Technical Resources

1Multi-GPU CUDA stress test

1need guide to build with CUDA 10.1 · Issue #26150 · tensorflow/tensorflow

1GPU support | TensorFlow

1Build from source | TensorFlow

2GitHub - wilicc/gpu-burn: Multi-GPU CUDA stress test

2YOLO: Real-Time Object Detection

1Articles | QuantStart

1CUDA-Z

1CUDA-Z

1Building Cross-Platform CUDA Applications with CMake | NVIDIA Developer Blog

1Deep Learning Frameworks | NVIDIA Developer

1An Even Easier Introduction to CUDA | NVIDIA Developer Blog

1Marvin: Deep Learning in N Dimensions

1An Easy Introduction to CUDA C and C++

1Exploring K-Means in Python, C++ and CUDA – Peter Goldsborough

1Making Theano Faster with CuDNN and CNMeM on Windows 10 – Ankivil

publications (hide)68
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing

2Data remanence and digital forensic investigation for CUDA Graphics Processing Units

2CUDA by Example: An Introduction to General-Purpose GPU Programming

1A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware

2Bones: an automatic skeleton-based C-to-CUDA compiler for GPUs

1Auto-tuning a high-level language targeted to GPU codes

3Finding Enclosures for Linear Systems Using Interval Matrix Multiplication in CUDA

2Polyhedral Parallel Code Generation for CUDA

2Memory Reuse Optimizations in the R-Stream Compiler

2Optimizing Data Warehousing Applications for GPUs Using Kernel Fusion/Fission

browse

related tags

similar tags

bookmarks (hide)63 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)68 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

similar tags

bookmarks (hide)63
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)68
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...