Difference between revisions of "ALUSaturation"
Jump to navigation
Jump to search
(Created page with "== Description == The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. == Symptoms == * Throughput...") |
m |
||
| (One intermediate revision by one other user not shown) | |||
| Line 1: | Line 1: | ||
| + | [[Category:Performance Pattern]] | ||
== Description == | == Description == | ||
The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. | The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core. | ||
| Line 9: | Line 10: | ||
== Detection == | == Detection == | ||
Use a hardware-counter tool like: | Use a hardware-counter tool like: | ||
| − | * LIKWID with performance groups FLOPS_SP, | + | * LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK |
| + | * Same information can be provided by perf or PAPI | ||
== Possible optimizations and/or fixes == | == Possible optimizations and/or fixes == | ||
Latest revision as of 15:13, 3 September 2019
Description
The pattern "ALU saturation" describes the performance limitation caused by fully utilizing a functional unit inside a CPU core.
Symptoms
- Throughput at design limit(s)
- Good (low) CPI
- Integral ratio of cycles to specific instruction count(s).
Detection
Use a hardware-counter tool like:
- LIKWID with performance groups FLOPS_SP, FLOPS_DP, DATA and CLOCK
- Same information can be provided by perf or PAPI