[v2,0/6] perf/x86/amd: Add memory controller events

Message ID cover.1696425185.git.sandipan.das@amd.com
Headers
Series perf/x86/amd: Add memory controller events |

Message

Sandipan Das Oct. 5, 2023, 5:23 a.m. UTC
  Unified Memory Controller (UMC) events were introduced with Zen 4 as a
part of the Performance Monitoring Version 2 (PerfMonV2) enhancements.
Currently, Zen 4 supports up to 12 channels of DDR5 memory and each of
them are controlled by a dedicated UMC. Each UMC, in turn, has its own
set of performance monitoring counters. These counters can provide info
on UMC command activity which in turn can be used to derive utilization
and bandwidth. Using perf tool, users can profile activity either on a
combined basis (includes all active UMCs) or for individual UMCs.

E.g. measurement across all UMCs

  $ sudo perf stat -e amd_umc/umc_cas_cmd.all/ -a -- sleep 1

   Performance counter stats for 'system wide':

	     544,810      amd_umc/umc_cas_cmd.all/

	 1.002012663 seconds time elapsed

E.g. measurement specific to certain UMCs

  $ sudo perf stat -e amd_umc_0/umc_cas_cmd.all/ -e amd_umc_4/umc_cas_cmd.all/ -a -- sleep 1

   Performance counter stats for 'system wide':

	      27,096      amd_umc_0/umc_cas_cmd.all/
	      35,136      amd_umc_4/umc_cas_cmd.all/

	 1.001602807 seconds time elapsed

The available UMCs can be found from sysfs and the socket to which they
belong can be derived from the cpumask.

E.g.

  $ find /sys/bus/event_source/devices/ -maxdepth 1 -name "amd_umc_*" | sort

  /sys/bus/event_source/devices/amd_umc_0
  /sys/bus/event_source/devices/amd_umc_1
  /sys/bus/event_source/devices/amd_umc_2
  /sys/bus/event_source/devices/amd_umc_3
  /sys/bus/event_source/devices/amd_umc_4
  /sys/bus/event_source/devices/amd_umc_5
  /sys/bus/event_source/devices/amd_umc_6
  /sys/bus/event_source/devices/amd_umc_7

  $ cat /sys/devices/amd_umc_0/cpumask
  0

  $ cat /sys/devices/amd_umc_4/cpumask
  96

All of the output above comes from a dual socket Genoa system having
96 cores and 4 populated memory channels per socket.

Previous versions can be found at:
v1: https://lore.kernel.org/all/cover.1689748843.git.sandipan.das@amd.com/

Changes in v2:
 - Move collection of PMU CPUID info to startup of UNCORE_STARTING.
 - Remove mechanism to read CPUID information using SMP callbacks.
 - Defer PMU registration to startup of UNCORE_ONLINE since this can
   only be done after collection of CPUID information.
 - Remove mechanism to collect and free up unused uncore contexts as
   this is no longer required.
 - Rename some structures (amd_uncore is now called amd_uncore_pmu and
   amd_uncore is instead a collection of amd_uncore_pmu instances).
 - Add new uncore management handlers (scan, init, move, free) which are
   called at different stages of CPU hotplug.
 - Add Acked-by from Ian Rogers for the JSON events.

Sandipan Das (6):
  perf/x86/amd/uncore: Refactor uncore management
  perf/x86/amd/uncore: Move discovery and registration
  perf/x86/amd/uncore: Use rdmsr if rdpmc is unavailable
  perf/x86/amd/uncore: Add group exclusivity
  perf/x86/amd/uncore: Add memory controller support
  perf vendor events amd: Add Zen 4 memory controller events

 arch/x86/events/amd/uncore.c                  | 1036 +++++++++++------
 arch/x86/include/asm/msr-index.h              |    4 +
 arch/x86/include/asm/perf_event.h             |    9 +
 .../arch/x86/amdzen4/memory-controller.json   |  101 ++
 .../arch/x86/amdzen4/recommended.json         |   84 ++
 tools/perf/pmu-events/jevents.py              |    2 +
 6 files changed, 879 insertions(+), 357 deletions(-)
 create mode 100644 tools/perf/pmu-events/arch/x86/amdzen4/memory-controller.json