From patchwork Mon Nov 14 07:41:54 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jing Zhang X-Patchwork-Id: 1589 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:6687:0:0:0:0:0 with SMTP id l7csp2017264wru; Sun, 13 Nov 2022 23:43:26 -0800 (PST) X-Google-Smtp-Source: AA0mqf4CuKcH+x6H2pUsx4bmDFPdJp3gjfvPDYEs61MQHqV1n6DCCVtewoFl67dvymsNT3zpF+xA X-Received: by 2002:a17:906:4e16:b0:7ae:72ae:1f85 with SMTP id z22-20020a1709064e1600b007ae72ae1f85mr9015391eju.133.1668411806618; Sun, 13 Nov 2022 23:43:26 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1668411806; cv=none; d=google.com; s=arc-20160816; b=hgtCnCMA9Tte1F92rgUphlKL71fEjtXxsPham02fpwZuHCm9WU2enqn444qbuAGTeC EYrVU0HNp5t5tVT/TpeDEPmRfyRV0IkWZ41ZaPPL5WJyJct734wZlDugIRztSGD9CuAy IRuNNJ6CeSahwep2sQuIAQNPbcMWokZcQio/2aX1WwQRPQTGjg6KorWNMDTFFm0q0abJ lkdFGXR3A54BOE8bG06/4oLs8E9WBxyyJ/3w9R02d1yHUWRkvFseu+1hEGvdYodDuC03 VKEx6wDrVYBqlk/KiWMPcKv84uA51ur9z1cHBGCMwXrUWT8wopAHWDls9pBkVIvZzj6I JXZw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:references:in-reply-to:message-id:date:subject :cc:to:from; bh=mmisO2oL5nLFs/ptZUR2GLY5cdq2Q6K9KWhsRZvjGsw=; b=vD1shHmNlDj5dov51HA/q6BVMA8c/cbHsbOX1ne4bjaGemgr6VGPN4sn4DnWNR0dYl V4JYAIaddUPlpXd9FX/PhtJ057onan4yi0ChmveUwCA5GNMyM1rq9UeE1TRKlaKgQkLD ajup1iT/lv/hYH+vlKojBMxk697HGrHh0hONxtaHaxyjp/QtQNJXZCLi/AlepSTQpV0p SoIe7oLh9BP4QV13wpsksDXYzj7xxzLyHlhREgrMQPShIxTtPAjqxzzjvtpRB7tTx9GU 76QYMgT55gWI9xB4Iwn8bzdKUM4sOwTpKhjE0eV0EGaJcdPES60FafYR1x6zC7Xnc9fl 2Oaw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id sg38-20020a170907a42600b00780076c3322si8619874ejc.432.2022.11.13.23.43.03; Sun, 13 Nov 2022 23:43:26 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235950AbiKNHmp (ORCPT + 99 others); Mon, 14 Nov 2022 02:42:45 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37610 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235617AbiKNHml (ORCPT ); Mon, 14 Nov 2022 02:42:41 -0500 Received: from out30-43.freemail.mail.aliyun.com (out30-43.freemail.mail.aliyun.com [115.124.30.43]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1320A12778; Sun, 13 Nov 2022 23:42:39 -0800 (PST) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R611e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018045170;MF=renyu.zj@linux.alibaba.com;NM=1;PH=DS;RN=19;SR=0;TI=SMTPD_---0VUjWhzm_1668411748; Received: from j66e01291.sqa.eu95.tbsite.net(mailfrom:renyu.zj@linux.alibaba.com fp:SMTPD_---0VUjWhzm_1668411748) by smtp.aliyun-inc.com; Mon, 14 Nov 2022 15:42:36 +0800 From: Jing Zhang To: linux-arm-kernel@lists.infradead.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, John Garry , Will Deacon , James Clark , Mike Leach , Leo Yan Cc: Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Andrew Kilroy , Shuai Xue , Zhuo Song , Jing Zhang Subject: [RFC PATCH v2 0/6] Add metrics for neoverse-n2 Date: Mon, 14 Nov 2022 15:41:54 +0800 Message-Id: <1668411720-3581-1-git-send-email-renyu.zj@linux.alibaba.com> X-Mailer: git-send-email 1.8.3.1 In-Reply-To: <1667214694-89839-1-git-send-email-renyu.zj@linux.alibaba.com> References: <1667214694-89839-1-git-send-email-renyu.zj@linux.alibaba.com> X-Spam-Status: No, score=-9.9 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2, SPF_HELO_NONE,SPF_PASS,UNPARSEABLE_RELAY,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1748201417459716951?= X-GMAIL-MSGID: =?utf-8?q?1749456578560533494?= Changes since v1: - Corrected formula for topdown L1 due to wrong counts for stall_slot and stall_slot_frontend; - Link: https://lore.kernel.org/all/1667214694-89839-1-git-send-email-renyu.zj@linux.alibaba.com/ This series add six metricgroups for neoverse-n2, among which, the formula of topdown L1 is from the document: https://documentation-service.arm.com/static/60250c7395978b529036da86?token= Due to the wrong count of stall_slot and stall_slot_frontend in neoverse-n2, the real stall_slot and real stall_slot_frontend need to subtract cpu_cycles, so when calculating the topdownL1 metrics, stall_slot and stall_slot_frontend are corrected. Since neoverse-n2 does not yet support topdown L2, metricgroups such as Cache, TLB, Branch, InstructionsMix, and PEutilization are added to help further analysis of performance bottlenecks. with this series on neoverse-n2: $./perf list ... Metric Groups: Branch: branch_miss_pred_rate [The rate of branches mis-predited to the overall branches] branch_mpki [The rate of branches mis-predicted per kilo instructions] branch_pki [The rate of branches retired per kilo instructions] Cache: l1d_cache_miss_rate [The rate of L1 D-Cache misses to the overall L1 D-Cache] l1d_cache_mpki [The rate of L1 D-Cache misses per kilo instructions] ... $sudo ./perf stat -a -M TLB sleep 1 Performance counter stats for 'system wide': 35,861,936 L1I_TLB # 0.00 itlb_walk_rate (74.91%) 5,661 ITLB_WALK (74.91%) 97,279,240 INST_RETIRED # 0.07 itlb_mpki (74.91%) 6,851 ITLB_WALK (74.91%) 26,391 DTLB_WALK # 0.00 dtlb_walk_rate (75.07%) 35,585,545 L1D_TLB (75.07%) 85,923,244 INST_RETIRED # 0.35 dtlb_mpki (75.11%) 29,992 DTLB_WALK (75.11%) 1.003450755 seconds time elapsed $sudo ./perf stat -M TopDownL1 false_sharing 2 Performance counter stats for 'false_sharing 2': 3,388,884,713 cpu_cycles # 0.05 retiring # 0.00 wasted (66.59%) 19,495,064,576 stall_slot (66.59%) 838,235,126 op_spec (66.59%) 836,787,162 op_retired (66.59%) 3,380,520,038 cpu_cycles # 0.29 frontend_bound (67.15%) 8,267,545,049 stall_slot_frontend (67.15%) 3,389,138,804 cpu_cycles # 0.67 backend_bound (66.66%) 11,337,766,816 stall_slot_backend (66.66%) 0.442572628 seconds time elapsed 1.235153000 seconds user 0.000000000 seconds sys Jing Zhang (6): perf vendor events arm64: Add topdown L1 metrics for neoverse-n2 perf vendor events arm64: Add TLB metrics for neoverse-n2 perf vendor events arm64: Add cache metrics for neoverse-n2 perf vendor events arm64: Add branch metrics for neoverse-n2 perf vendor events arm64: Add PE utilization metrics for neoverse-n2 perf vendor events arm64: Add instruction mix metrics for neoverse-n2 .../arch/arm64/arm/neoverse-n2/metrics.json | 247 +++++++++++++++++++++ 1 file changed, 247 insertions(+) create mode 100644 tools/perf/pmu-events/arch/arm64/arm/neoverse-n2/metrics.json