TDDD56 Multicore and GPU Programming
Timetable and Lecture Plan
Schedule (as available on the LiU schedule server)
Certain lecture notes and other handouts with restricted access
are located here.
The lecture notes and other material may be updated during the course as appropriate.
Christoph Kessler (CK), Ingemar Ragnemalm (IR).
Assistants: August Ernstsson (AE), Lu Li (LL) Ingemar Ragnemalm (IR),
Those lectures marked by asterisks overlap fully (**) or partly (*) with similar lectures in TDDC78 Programming of Parallel Computers - Methods and Tools. These lectures are optional for those who have already taken TDDC78, but might be a useful repetition anyway. This repetition of the common core topics is necessary to allow you to take the courses individually or in arbitrary order.
- Lecture 1:
Motivation: The Multicore Challenge. Multicore Architecture Concepts. (CK)
Lecture 2: (**)
Shared memory architecture concepts and performance issues. (CK)
- Lecture 3:
Parallel Programming with Threads and Tasks. (CK)
- Lecture 3 (cont., 45min) (CK) +
Lesson 1 (45min):
Introduction to CPU laboratory work. (AE)
- Lecture 4:
Non-blocking synchronization. (CK)
- Lecture 5: (*)
Theory: Parallel programming and cost models. Analysis of parallel algorithms.
Theory (cont.): (*) Speedup anomalies. Amdahl's Law. Fundamental parallel algorithms: parallel prefix sums, parallel list ranking. (CK)
Parallel sorting algorithms: Simple parallel quicksort. (CK)
- Lecture 7:
Parallel sorting algorithms (cont.): Samplesort, Fully parallel quicksort, Bitonic sort, Parallel Mergesort. (CK)
- Lecture 8: (45min)
Parallel algorithmic design patterns: Towards skeleton programming. (CK)
Lesson 2: (45min)
Introduction to skeleton programming in SkePU, and to CPU Lab 3. (AE)
- Lecture 9:
GPU architecture and trends (IR)
- Lecture 10:
Introduction to CUDA programming. (IR)
- Lecture 11:
CUDA programming. GPU lab introduction. (IR)
- Lecture 12:
Sorting on GPU. Advanced CUDA issues. (IR)
- Lecture 13:
Introduction to OpenCL. (IR)
- Lesson 3:
OpenCL. Shader programming.
Selected exercises. (IR)
- Lesson 4:
Selected CPU/theory exercises. (AE)
Please solve suggested exercises in advance to be prepared. The following summary of important definitions may help as a reminder of some theory definitions.
- Lecture 14: (**)
Parallelization of sequential programs. (CK)
Multicore Programming Contest Awards. Outlook.
We have three lab groups, see the schedule.
- Group_A (32 students, room Southfork, supervised by August Ernstsson (CPU), Lu Li (GPU))
- Group_B (16 students, room IDA-Multicore Lab, supervised by Lu Li (CPU) and Ingemar Ragnemalm (GPU))
- Group_C (16 students, room IDA-Multicore Lab, supervised by August Ernstsson (CPU) and Lu Li (GPU))
Presence in the lab sessions is mandatory.
Deadlinessee the lab page.
Page responsible: Christoph W Kessler
Last updated: 2018-01-04