Profile DPC++ and GPU Workload Performance

Developers who deploy applications across both CPUs to GPUs are often challenged to find the best methods for analyzing and optimizing offload performance.

In this webinar, technical consulting engineer Vladimir Tsymbal demonstrates how it can be done using the Intel® oneAPI beta version of Intel® VTune™ Profiler, a performance analysis tool that takes the guesswork out of cross-architecture performance improvements.

Using a sample application written in Data Parallel C++ (DPC++), Vladimir will demonstrate Intel VTune Profiler (beta) can be used to:

  • Profile DPC++ code running on both host and GPU processors
  • Collect the right data and turn it into rich, easily interpretable analysis
  • Identify the hotspots in your compute kernels, including which are key areas for optimization
  • Show how the GPU resources are being utilized and locate hardware bottlenecks

Technical Level: Intermediate

Get started with oneAPI

More Intel® VTune™ Profiler (beta) resources

Vladimir Tsymbal, Intel Senior Technical Consulting Engineer, Intel Corporation

Vladimir Tsymbal is a senior technical consulting engineer who specializes in teaching customers how to use a variety of Intel® Software Tools to develop, tune, and optimize their parallel applications on Intel® Architecture. In particular, his focus is on Intel® Parallel Studio XE product suite and the analysis tools it contains, including Intel® VTune™ Profiler (which he helped develop), Intel® Advisor, and Intel® Inspector.

Prior to joining Intel in 2005, Vladimir worked as a research assistant, and developed hardware graphics accelerators and software and hardware systems for medical diagnostics. He holds a PhD in Mathematics and Computer Science from Taganrog State University of Radio Engineering, Russia.

For more complete information about compiler optimizations, see our Optimization Notice.