Lecture (Other) FZJ-2022-05803

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
GPU Programming with CUDA

 ;  ;  ;  ;

2022

Lecture at PRACE Training Course at JSC (online), 25 Apr 2022 - 29 Apr 20222022-04-252022-04-29

Please use a persistent id in citations:

Abstract: GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GPUs offers high application performance by offloading compute-intensive portions of the code to an NVIDIA GPU. The course covers basic aspects of GPU architectures and programming. Focus is on the usage of the parallel programming language CUDA C/C++ which allows maximum control of NVIDIA GPU hardware. Examples of increasing complexity are used to demonstrate optimization and tuning of scientific applications. Topics covered will include: Introduction to GPU/Parallel computing; Programming model CUDA; GPU libraries like CuBLAS and CuFFT; Tools for debugging and profiling; Performance optimizations; Advanced GPU programming model; CUDA Fortran in a nutshell.This course is a PRACE training course.


Note: Online course within the PRACE and FZJ training program.

Contributing Institute(s):
  1. Jülich Supercomputing Center (JSC)
Research Program(s):
  1. 5112 - Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups (POF4-511) (POF4-511)
  2. 5111 - Domain-Specific Simulation & Data Life Cycle Labs (SDLs) and Research Groups (POF4-511) (POF4-511)
  3. PRACE-6IP - PRACE 6th Implementation Phase Project (823767) (823767)

Appears in the scientific report 2022
Database coverage:
OpenAccess
Click to display QR Code for this record

The record appears in these collections:
Document types > Presentations > Lectures
Workflow collections > Public records
Institute Collections > JSC
Publications database
Open Access

 Record created 2022-12-13, last modified 2022-12-21


OpenAccess:
08-aherten-cooperative-groups - Download fulltext PDF
06-haghighi-CUDA_Streams_and_Events - Download fulltext PDF
05-jkraus-Multi_GPU_Programming_with_MPI_and_CUDA - Download fulltext PDF
02-mhrywniak-cuda_tools - Download fulltext PDF
09-jmeinke-CUDA-C++ - Download fulltext PDF
10-jmeinke-CUB_intro - Download fulltext PDF
03-haghighi-Matrix_Multiplication - Download fulltext PDF
11-haghighi-CudaFortran - Download fulltext PDF
01-aherten-cuda-intro - Download fulltext PDF
07-haghighi-CUDA_DGEMM_Tiled - Download fulltext PDF
04-mhrywniak-perf_opt - Download fulltext PDF
External links:
Download fulltextFulltext
Download fulltextFulltext by OpenAccess repository
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)