Online: Node Level Performance Optimization
This course includes topics on code optimization for x86 platforms (Intel and AMD CPUs) and efficient code parallelization on node-level using OpenMP threading. Advanced aspects of threading and optimization, such as selected topics of new features in OpenMP 5.0 standard, will be covered during the course.
Learning outcome
- Awareness of modern features in x86 CPUs;
- Ability to vectorize computations;
- Ability to use advanced features of OpenMP;
- Ability to increase code performance using threading and x86 optimization.
Prerequisites and content level
- Good knowledge of C/C++ or Fortran;
- Good knowledge of threading using OpenMP;
- Basic knowledge of modern CPU architectures.
The content level of the course is broken down as: beginner’s – 0%, intermediate – 20%, advanced – 80%, community-targeted content – 0%.
Agenda
Day 1, Tuesday 18 May 2021
09:00 – 12:30
– Introduction to performance engineering
– Overview of modern multicore CPUs
12:30 – 13:30 Lunch break
13:30 – 16:30
– Intel performance analysis tools
– AMD performance analysis tools
Day 2, Wednesday 19 May 2021
09:00 – 12:00
– Vectorization
12:00 – 13:00 Lunch break
13:00 – 16:30
– Optimizing memory access
– Other single core optimization topics
Day 3, Thursday 20 May 2021
09:00 – 12:00
– OpenMP tasks
12:00 – 13:00 Lunch break
13:00 – 16:30
– Threading optimization
– Future outlook on OpenMP
Lecturers:
Jussi Enkovaara (CSC), Mikko Byckling (Intel), Michael Klemm (AMD)
Language: English
Price: Free of charge (3 training days)
For further detailed information and registration please visit:
https://events.prace-ri.eu/e/NodeLevelPerformanceOptimization_2021
REGISTRATION CLOSED
If you have registered to this course and you are not able to attend, please CANCEL your registration in advance by sending an email to patc@csc.fi
Time
18.5.2021 - 20.5.2021
09:00 - 16:30