Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels

21 Oct 2019 Laukemann Jan Hammer Julian Hager Georg Wellein Gerhard

Useful models of loop kernel runtimes on out-of-order architectures require an analysis of the in-core performance behavior of instructions and their dependencies. While an instruction throughput prediction sets a lower bound to the kernel runtime, the critical path defines an upper bound... (read more)

PDF Abstract