Name: Introduction to High-Performance Computing
Start: 2013-08-19T08:00:00+02:00
End: 2013-08-30T15:00:00+02:00
Location: KTH main campus

Introduction to High-Performance Computing

from Monday 19 August 2013 (08:00) to Friday 30 August 2013 (15:00)

Monday 19 August 2013

08:30 Registration
Registration
08:30 - 09:15
Room: E3
09:15 Welcome to PDC and the Summer School - Erwin Laure (PDC-HPC)
Welcome to PDC and the Summer School
- Erwin Laure (PDC-HPC)
09:15 - 10:15
Room: E3
10:15 Coffee break
Coffee break
10:15 - 10:30
10:30 Concepts and Algorithms for Scientific Computing - Björn ENGQUIST
Concepts and Algorithms for Scientific Computing
- Björn ENGQUIST
10:30 - 12:15
Room: E3
12:15 Picnic
Picnic
12:15 - 14:00
14:00 Introduction to PDC's Environment - Izhar UL HASSAN
Introduction to PDC's Environment
- Izhar UL HASSAN
14:00 - 15:00
Room: E3
15:00 Coffee break
Coffee break
15:00 - 15:15
15:15 Lab: Introduction to PDC's Environment - Izhar UL HASSAN
Lab: Introduction to PDC's Environment
- Izhar UL HASSAN
15:15 - 16:45
Room: Room: 4V2Röd, 4V3Ora
Tuesday 20 August 2013
09:15 High-Performance Computer Architecture - Erik HAGERSTEN
High-Performance Computer Architecture
- Erik HAGERSTEN
09:15 - 10:00
Room: E3
10:00 Coffee break
Coffee break
10:00 - 10:15
10:15 High-Performance Computer Architecture - Erik HAGERSTEN
High-Performance Computer Architecture
- Erik HAGERSTEN
10:15 - 12:00
12:00 Individual lunch
Individual lunch
12:00 - 13:00
13:00 High-Performance Computer Architecture - Erik HAGERSTEN
High-Performance Computer Architecture
- Erik HAGERSTEN
13:00 - 15:00
Room: E3
15:00 Coffee break
Coffee break
15:00 - 15:15
15:15 Lab: Introduction to PDC's Environment - Izhar UL HASSAN
Lab: Introduction to PDC's Environment
- Izhar UL HASSAN
15:15 - 17:00
Room: 4V2Röd, 4V3Ora
PDC Machine Room Tour
PDC Machine Room Tour
15:15 - 17:00
Wednesday 21 August 2013
09:15 Shared memory programming, OpenMP - Thomas ERICSSON
Shared memory programming, OpenMP
- Thomas ERICSSON
09:15 - 10:00
Two common ways of speeding up programs are to use processes or threads executing in parallel. The second week of the Summer School presents process-based parallel computing using MPI. In this lecture parallelization using threads will be discussed. Threads can be managed in different ways, e.g. using a low level library, like the POSIX threads library. The lecture deals with OpenMP, which is a more convenient way to use threads. OpenMP is a specification for a set of compiler directives, library routines, and environment variables that can be used to specify shared memory parallelism in Fortran and C/C++ programs. The threaded program can be executed on a shared memory computer (e.g. a multi- core processor). Typically, the iterations of a time consuming loop are shared among a team of threads, each thread working on part of the iterations. The loop is parallelized by writing a directive in the code and a compiler or a pre- processor will generate parallel code. Using short code examples, the lecture covers the most important OpenMP-constructs, how to use them and how not to use them. A few more realistic examples are presented as well.
10:00 Coffee break
Coffee break
10:00 - 10:15
10:15 Shared memory programming, OpenMP - Thomas ERICSSON
Shared memory programming, OpenMP
- Thomas ERICSSON
10:15 - 12:00
Room: E3
12:00 Individual lunch
Individual lunch
12:00 - 13:00
13:15 Shared memory programming, OpenMP - Thomas ERICSSON
Shared memory programming, OpenMP
- Thomas ERICSSON
13:15 - 14:00
Room: E3 T
14:15 Lab: Programming Exercises on OpenMP - Niclas JANSSON Stefano MARKIDIS
Lab: Programming Exercises on OpenMP
- Niclas JANSSON
- Stefano MARKIDIS
14:15 - 15:00
Room: 4V2Röd, 4V3Ora
15:00 Coffee break
Coffee break
15:00 - 15:15
15:15 Lab: Programming Exercises on OpenMP - Stefano MARKIDIS Niclas JANSSON
Lab: Programming Exercises on OpenMP
- Stefano MARKIDIS
- Niclas JANSSON
15:15 - 16:00
Room: 4V2Röd, 4V3Ora
Thursday 22 August 2013
09:15 Code optimization - Thomas ERICSSON
Code optimization
- Thomas ERICSSON
09:15 - 10:00
Room: E3 In this lecture techniques for speeding up code on a uniprocessor are presented. The lecture covers basic optimization principles such as using the memory hierarchy in an efficient way. Using short code examples, the lecture covers stride minimization, blocking, loop- fusion, loop- splitting and other approaches. The choice of programming language (Fortran or C), and compiler will be discussed. There are some hints of how to tune Matlab programs. Other topics are vector arithmetic for floating point and elementary functions, virtual memory, files and the LAPACK and BLAS libraries.
10:00 Coffee break
Coffee break
10:00 - 10:15
10:15 Code optimization - Thomas ERICSSON
Code optimization
- Thomas ERICSSON
10:15 - 12:00
Room: E3
12:00 Individual lunch
Individual lunch
12:00 - 13:00
13:15 Lab: OpenMP Advanced Project - Niclas JANSSON Stefano MARKIDIS
Lab: OpenMP Advanced Project
- Niclas JANSSON
- Stefano MARKIDIS
13:15 - 15:00
Room: 4V2Röd, 4V3Ora
15:00 Coffee break
Coffee break
15:00 - 15:15
15:15 Lab: OpenMP Advanced Project - Niclas JANSSON Stefano MARKIDIS
Lab: OpenMP Advanced Project
- Niclas JANSSON
- Stefano MARKIDIS
15:15 - 17:00
Room: 4V2Röd, 4V3Ora
Friday 23 August 2013
08:30 Wrap up: OpenMP Advanced Project - Stefano MARKIDIS Niclas JANSSON
Wrap up: OpenMP Advanced Project
- Stefano MARKIDIS
- Niclas JANSSON
08:30 - 09:00
Room: E3
09:15 GPU Architectures for Non-Graphics People - David BLACK-SCHAFFER
GPU Architectures for Non-Graphics People
- David BLACK-SCHAFFER
09:15 - 10:00
Room: E3 Today everyone is positioning GPUs for general purpose computing. They claim that you can get 10-100x speedups over conventional CPUs, and sometimes they're even right. However, to get the most out of current- (and next-) generation GPUs, one needs to understand the architectural differences and how they effect your choice of algorithm. In this talk I will cover GPU architecture in comparison to current CPUs, discuss the implications for getting good performance, and introduce OpenCL as a general-purpose programming language for accessing GPUs and CPUs today.
10:00 Coffee break
Coffee break
10:00 - 10:15
10:15 GPUs: The Hype, The Reality, and The Future - David Black-Schaffer
GPUs: The Hype, The Reality, and The Future
- David Black-Schaffer
10:15 - 11:00
Room: E3 We'll take a look at the promise and the results of porting applications to GPUs and try to distinguish the hype from the reality. We'll take a closer look at where GPUs fit into HPC today and in the near future, and try to do a fair comparison of GPU optimized programs, CPU optimized programs, and even take a look at competing accelerators such as the Intel Xeon Phi. The goal of this lecture is to give you some perspective on GPU technology and where it may (or may not) fit into HPC in the next few years.
11:00 Introduction to CUDA - Michael SCHLIEPHAKE
Introduction to CUDA
- Michael SCHLIEPHAKE
11:00 - 12:00
Room: E3 CUDA is a parallel computing platform and programming model for NVIDIA GPU devices that allows to use the capabilities of the hardware efficiently. Lecture and lab give an overview of CUDA, discuss aspects of programming that needs the programmer's attention in order to gain the desired performance, and provide a first hands-on experience.
12:00 Individual lunch
Individual lunch
12:00 - 13:00
13:15 Lab: CUDA - Michael SCHLIEPHAKE
Lab: CUDA
- Michael SCHLIEPHAKE
13:15 - 14:15
Room: 4V2Röd, 4V3Ora
14:15 Introduction to CUDA - Michael SCHLIEPHAKE
Introduction to CUDA
- Michael SCHLIEPHAKE
14:15 - 15:00
Room: E3
15:00 Coffee break
Coffee break
15:00 - 15:15
15:15 Introduction to CUDA - Michael SCHLIEPHAKE
Introduction to CUDA
- Michael SCHLIEPHAKE
15:15 - 16:00
Room: E3
16:00 Lab: CUDA - Michael SCHLIEPHAKE
Lab: CUDA
- Michael SCHLIEPHAKE
16:00 - 17:00
Room: 4V2Röd, 4V3Ora
Saturday 24 August 2013
Sunday 25 August 2013