Message ID | 20231104133216.42056-1-xueshuai@linux.alibaba.com |
---|---|
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:8f47:0:b0:403:3b70:6f57 with SMTP id j7csp1651568vqu; Sat, 4 Nov 2023 06:32:44 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEGanmnKlFdaQFmS3n8ved6/7Gdy8/BGU+6GXFAoEKoUDhpzzg57LS7fpvZp8q6MHtvJ49d X-Received: by 2002:a05:6870:f621:b0:1e9:cc21:295f with SMTP id ek33-20020a056870f62100b001e9cc21295fmr3804158oab.16.1699104764636; Sat, 04 Nov 2023 06:32:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1699104764; cv=none; d=google.com; s=arc-20160816; b=wrVhwL1o4mE4MVUSLQdA4+QgvMXCmK2WSOT4k1r4i+5asN8y8pdm89vs4+2/xIB3Sm 4k8ymRXzFS76IcxKXyJkxi8Nyi1p04R3xBtAsj6YG4EhYGEcOUoniD78PsYFvroAMsKp V3MptMvLiodDonzJmfUuGFXZ95Mg7bABj6Ty+PQI+VDvlqHMslKXV382Mvx85QEto0AW TM1I48DhQhboIbGH0UuYJDr5BVi84XLrSYy090wqstrtpUN7LENM+7ZCz/KOx+Uc4HoV WF1I1oVLTU2tgEs0BKqyhglFQqyTiG/DiI/diDSEitPFeq9+YAZwNiUhGMDaNeLi1EvA kGRg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=/AuyXc2nbCipCxQM1/OFeWKYLMSjQG+gwfTeubA0ecI=; fh=JTFo9jksM3iJMk5d1s6bsW43YoyiK0Zjg5tW6S+IgPI=; b=BgaWjicTaCp4s6HBp9ZbtNLyRjWxSL8xJOGDwtFKyqgn3UXog6RojIWGIWnLaz0dSZ 3MMfZJjhoenumZiOHvRnjoBFK+gRMc3cXHh4rfAg6yE+xU/YzUQa6C9AmsLuRcSGKMrm yTcCJsyyu8M88yAQqo85Xafop17B1NtVhRtQ2zpkNvaWWF5eGa/jFmyZTzRhNesI5M4M EZSN//rso/nyjdhausHHithB+rUrOYAzruXBQvygi1rmNTi+twB6d2aatSlEoG0MCqNQ 78SyQv6KSvOvdaUggKphcas0LP5ohZCcTJFrKM7WcWhlLsUiJnLNxD1dg/e1SZT+sl3q tazA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: from snail.vger.email (snail.vger.email. [2620:137:e000::3:7]) by mx.google.com with ESMTPS id qc2-20020a056871480200b001efb15a7e3csi1543732oab.38.2023.11.04.06.32.44 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 04 Nov 2023 06:32:44 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) client-ip=2620:137:e000::3:7; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:7 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id 7EB3480408F1; Sat, 4 Nov 2023 06:32:43 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232252AbjKDNck (ORCPT <rfc822;lhua1029@gmail.com> + 35 others); Sat, 4 Nov 2023 09:32:40 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56250 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229456AbjKDNcg (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Sat, 4 Nov 2023 09:32:36 -0400 Received: from out30-101.freemail.mail.aliyun.com (out30-101.freemail.mail.aliyun.com [115.124.30.101]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F385ED45; Sat, 4 Nov 2023 06:32:30 -0700 (PDT) X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R391e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018045192;MF=xueshuai@linux.alibaba.com;NM=1;PH=DS;RN=16;SR=0;TI=SMTPD_---0VvcM-NJ_1699104741; Received: from localhost.localdomain(mailfrom:xueshuai@linux.alibaba.com fp:SMTPD_---0VvcM-NJ_1699104741) by smtp.aliyun-inc.com; Sat, 04 Nov 2023 21:32:24 +0800 From: Shuai Xue <xueshuai@linux.alibaba.com> To: kaishen@linux.alibaba.com, helgaas@kernel.org, yangyicong@huawei.com, will@kernel.org, Jonathan.Cameron@huawei.com, baolin.wang@linux.alibaba.com, robin.murphy@arm.com Cc: chengyou@linux.alibaba.com, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-pci@vger.kernel.org, rdunlap@infradead.org, mark.rutland@arm.com, zhuo.song@linux.alibaba.com, xueshuai@linux.alibaba.com, renyu.zj@linux.alibaba.com Subject: [PATCH v10 0/5] drivers/perf: add Synopsys DesignWare PCIe PMU driver support Date: Sat, 4 Nov 2023 21:32:11 +0800 Message-Id: <20231104133216.42056-1-xueshuai@linux.alibaba.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-9.9 required=5.0 tests=BAYES_00, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,UNPARSEABLE_RELAY,USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Sat, 04 Nov 2023 06:32:43 -0700 (PDT) X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1781640477480069367 X-GMAIL-MSGID: 1781640477480069367 |
Series |
drivers/perf: add Synopsys DesignWare PCIe PMU driver support
|
|
Message
Shuai Xue
Nov. 4, 2023, 1:32 p.m. UTC
Change Log ========== - move the &plat_dev->dev to previous line to warp more beautiful (Per Jonathan) - rename error label with the same suffix 'err' (Per Jonathan) - drop unnecessary else branch and return directly (Per Baolin) - warp out set prev_count from dwc_pcie_pmu_set_period (Per Baolin) - use PMU_FORMAT_ATTR to simplify format sysfs stuff (Per Will) - export pci_clear_and_set_dword() to simplify _enable() functions (Per Will) - simplify _read() function by unconditionally calculate with unit in bytes plused if branch for group#1 event (Per Will and Robin) - simplify _update() function by unconditionally mask with 64-bit width plused if branch for lane event (Per Will) - add type sanity check in _init() (Per Will) - test with fuzzing tool before this new version (Per Will) - register a platform device for each PCI device to probe RAS_DES PMU cap (Per Robin) - add dwc_pcie_vendor_ids to extend vendor id for future added device (Per Krishna) - pickup review-by tag from Baolin, Yicong and Jonathan changes since v8: - tidy up doc and pick up Reviewed-by tag in pathc 1/4 (Per Yicong and Jonathan) - rename ras_des as ras_des_offset and remove coment about it (Per Jonathan) - keep exact reverse order of what happened in probe when unregister pmu (Per Jonathan) - remove the hender length check to make the driver more compatible (Per Jonathan) - move unwind of register pmu forward to fix list_del() on something that was never added (Per Jonathan) - use devm_add_action_or_reset() to unwind bus_register_notifier() (Per Jonathan) - use local pdev directly in dwc_pcie_pmu_offline_cpu() and dwc_pcie_pmu_probe() (Per Jonathan) - use the approach of a gotos to handle errors in dwc_pcie_pmu_init() (Per Jonathan) - multiply the counter value at point of read by 16 for group#1 events (Per Jonathan) changes since v7: - add config help with alibaba name (per Bjorn) - remove the ARM64 dependency (per Bjorn and Jonathan) - fix typo and column warp (per Bjorn) - move list_del() after perf_pmu_unregister() (per Bjorn) - reorder the funtions by interests (per Bjorn) - rewrite commit log about PMU counters, also update doc (per Bjorn) - extend to support stat time-based analysis and lane event at the same time (per Bjorn and Jonathan) Link: https://lore.kernel.org/linux-arm-kernel/20231012032856.2640-2-xueshuai@linux.alibaba.com/T/ changes since v6: - improve editorial things in doc (Per Jonathan) - change config help to generic text (Per Jonathan) - remove macro to_dwc_pcie_pmu by moving pmu as the first member to struct dwc_pcie_pmu (Per Yicong) - add event type check in dwc_pcie_event_show() to keep consistent with other function (Per Jonathan) - remove intended blank line (Per Yicong) - protect against lower 32 bits of counter overflow by try again trick (Per Jonathan) - call pci_dev_put on all the return branch to keep the refcnt balance (Per Jonathan and Yicong) - use devm_add_action_or_reset() to automatic unwind (Per Jonathan) - fix picking numa-aware context cpu up when offline and offline cpu (Per Jonathan) - simplify online cpu by init pcie_pmu->on_cpu as -1 (Per Jonathan) - add bus_register_notifier() to handle rootport hotplug (Per Yicong) - pick up Acked-by from Bjorn for patch 2/4 (Per Bjorn) Link: https://lore.kernel.org/lkml/20230606074938.97724-1-xueshuai@linux.alibaba.com/T/ changes since v5: - Rewrite the commit log to follow policy in pci_ids.h (Bjorn Helgaas) - return error code when __dwc_pcie_pmu_probe failed (Baolin Wang) - call 'cpuhp_remove_multi_state()' when exiting the driver. (Baolin Wang) - pick up Review-by tag from Baolin for Patch 1 and 3 Link: https://lore.kernel.org/lkml/ZGuSimj1cuQl3W5L@bhelgaas/T/#mba3fa2572dde0deddb40b5b24a31f4df41004bdf changes since v4: 1. addressing commens from Bjorn Helgaas: - reorder the includes by alpha - change all macros with upper-case hex - change ras_des type into u16 - remove unnecessary outer "()" - minor format changes 2. Address commensts from Jonathan Cameron: - rewrite doc and add a example to show how to use lane event 3. fix compile error reported by: kernel test robot - remove COMPILE_TEST and add depend on PCI in kconfig - add Reported-by: kernel test robot <lkp@intel.com> Changes since v3: 1. addressing comments from Robin Murphy: - add a prepare patch to define pci id in linux/pci_ids.h - remove unnecessary 64BIT dependency - fix DWC_PCIE_PER_EVENT_OFF/ON macro - remove dwc_pcie_pmu struct and move all its fileds into dwc_pcie_rp_info - remove unnecessary format field show - use sysfs_emit() instead of all the assorted sprintf() and snprintf() calls. - remove unnecessary spaces and remove unnecessary cast to follow event show convention - remove pcie_pmu_event_attr_is_visible - fix a refcout leak on error branch when walk pci device in for_each_pci_dev - remove bdf field from dwc_pcie_rp_info and calculate it at runtime - finish all the checks before allocating rp_info to avoid hanging wasted memory - remove some unused fields - warp out control register configuration from sub function to .add() - make function return type with a proper signature - fix lane event count enable by clear DWC_PCIE_CNT_ENABLE field first - pass rp_info directly to the read_*_counter helpers and in start, stop and add callbacks - move event type validtion into .event_init() - use is_sampling_event() to be consistent with everything else of pmu drivers - remove unnecessary dev_err message in .event_init() - return EINVAL instead EOPNOTSUPP for not a valid event - finish all the checks before start modifying the event - fix sibling event check by comparing event->pmu with sibling->pmu - probe PMU for each rootport independently - use .update() as .read() directly - remove dynamically generating symbolic name of lane event - redefine static symbolic name of lane event and leave lane filed to user - add CPU hotplug support 2. addressing comments from Baolin: - add a mask to avoid possible overflow Changes since v2 addressing comments from Baolin: - remove redundant macro definitions - use dev_err to print error message - change pmu_is_register to boolean - use PLATFORM_DEVID_NONE macro - fix module author format Changes since v1: 1. address comments from Jonathan: - drop marco for PMU name and VSEC version - simplify code with PCI standard marco - simplify code with FIELD_PREP()/FIELD_GET() to replace shift marco - name register filed with single _ instead double - wrap dwc_pcie_pmu_{write}_dword out and drop meaningless snaity check - check vendor id while matching vesc with pci_find_vsec_capability() - remove RP_NUM_MAX and use a list to organize PMU devices for rootports - replace DWC_PCIE_CREATE_BDF with standard PCI_DEVID - comments on riping register together 2. address comments from Bjorn: - rename DWC_PCIE_VSEC_ID to DWC_PCIE_VSEC_RAS_DES_ID - rename cap_pos to ras_des - simplify declare of device_attribute with DEVICE_ATTR_RO - simplify code with PCI standard macro and API like pcie_get_width_cap() - fix some code style problem and typo - drop meaningless snaity check of container_of 3. address comments from Yicong: - use sysfs_emit() to replace sprintf() - simplify iteration of pci device with for_each_pci_dev - pick preferred CPUs on a near die and add comments - unregister PMU drivers only for failed ones - log on behalf PMU device and give more hint - fix some code style problem (Thanks for all comments and they are very valuable to me) Shuai Xue (5): docs: perf: Add description for Synopsys DesignWare PCIe PMU driver PCI: Add Alibaba Vendor ID to linux/pci_ids.h PCI: move pci_clear_and_set_dword helper to pci header drivers/perf: add DesignWare PCIe PMU driver MAINTAINERS: add maintainers for DesignWare PCIe PMU driver .../admin-guide/perf/dwc_pcie_pmu.rst | 94 +++ Documentation/admin-guide/perf/index.rst | 1 + MAINTAINERS | 7 + drivers/infiniband/hw/erdma/erdma_hw.h | 2 - drivers/pci/access.c | 12 + drivers/pci/pcie/aspm.c | 11 - drivers/perf/Kconfig | 7 + drivers/perf/Makefile | 1 + drivers/perf/dwc_pcie_pmu.c | 798 ++++++++++++++++++ include/linux/pci.h | 2 + include/linux/pci_ids.h | 2 + 11 files changed, 924 insertions(+), 13 deletions(-) create mode 100644 Documentation/admin-guide/perf/dwc_pcie_pmu.rst create mode 100644 drivers/perf/dwc_pcie_pmu.c
Comments
Hi Shuai, On Sat, 4 Nov 2023, Shuai Xue wrote: > Change Log > ========== > > - move the &plat_dev->dev to previous line to warp more beautiful (Per Jonathan) > - rename error label with the same suffix 'err' (Per Jonathan) > - drop unnecessary else branch and return directly (Per Baolin) > - warp out set prev_count from dwc_pcie_pmu_set_period (Per Baolin) > - use PMU_FORMAT_ATTR to simplify format sysfs stuff (Per Will) > - export pci_clear_and_set_dword() to simplify _enable() functions (Per Will) > - simplify _read() function by unconditionally calculate with unit in bytes plused if branch for group#1 event (Per Will and Robin) > - simplify _update() function by unconditionally mask with 64-bit width plused if branch for lane event (Per Will) > - add type sanity check in _init() (Per Will) > - test with fuzzing tool before this new version (Per Will) > - register a platform device for each PCI device to probe RAS_DES PMU cap (Per Robin) > - add dwc_pcie_vendor_ids to extend vendor id for future added device (Per Krishna) > - pickup review-by tag from Baolin, Yicong and Jonathan ... > Shuai Xue (5): > docs: perf: Add description for Synopsys DesignWare PCIe PMU driver > PCI: Add Alibaba Vendor ID to linux/pci_ids.h > PCI: move pci_clear_and_set_dword helper to pci header > drivers/perf: add DesignWare PCIe PMU driver > MAINTAINERS: add maintainers for DesignWare PCIe PMU driver As I mentioned earlier, I successfully tested your patchset with a few patches on top of it to enable DWC PCIe PMU on AmpereOne. Thus, feel free to add this tag to all the patches above: Tested-by: Ilkka Koskinen <ilkka@os.amperecomputing.com> Br, Ilkka > > .../admin-guide/perf/dwc_pcie_pmu.rst | 94 +++ > Documentation/admin-guide/perf/index.rst | 1 + > MAINTAINERS | 7 + > drivers/infiniband/hw/erdma/erdma_hw.h | 2 - > drivers/pci/access.c | 12 + > drivers/pci/pcie/aspm.c | 11 - > drivers/perf/Kconfig | 7 + > drivers/perf/Makefile | 1 + > drivers/perf/dwc_pcie_pmu.c | 798 ++++++++++++++++++ > include/linux/pci.h | 2 + > include/linux/pci_ids.h | 2 + > 11 files changed, 924 insertions(+), 13 deletions(-) > create mode 100644 Documentation/admin-guide/perf/dwc_pcie_pmu.rst > create mode 100644 drivers/perf/dwc_pcie_pmu.c > > -- > 2.39.3 > > > _______________________________________________ > linux-arm-kernel mailing list > linux-arm-kernel@lists.infradead.org > http://lists.infradead.org/mailman/listinfo/linux-arm-kernel >
On 2023/11/16 08:57, Ilkka Koskinen wrote: > > Hi Shuai, > > On Sat, 4 Nov 2023, Shuai Xue wrote: >> Change Log >> ========== >> >> - move the &plat_dev->dev to previous line to warp more beautiful (Per Jonathan) >> - rename error label with the same suffix 'err' (Per Jonathan) >> - drop unnecessary else branch and return directly (Per Baolin) >> - warp out set prev_count from dwc_pcie_pmu_set_period (Per Baolin) >> - use PMU_FORMAT_ATTR to simplify format sysfs stuff (Per Will) >> - export pci_clear_and_set_dword() to simplify _enable() functions (Per Will) >> - simplify _read() function by unconditionally calculate with unit in bytes plused if branch for group#1 event (Per Will and Robin) >> - simplify _update() function by unconditionally mask with 64-bit width plused if branch for lane event (Per Will) >> - add type sanity check in _init() (Per Will) >> - test with fuzzing tool before this new version (Per Will) >> - register a platform device for each PCI device to probe RAS_DES PMU cap (Per Robin) >> - add dwc_pcie_vendor_ids to extend vendor id for future added device (Per Krishna) >> - pickup review-by tag from Baolin, Yicong and Jonathan > > ... > >> Shuai Xue (5): >> docs: perf: Add description for Synopsys DesignWare PCIe PMU driver >> PCI: Add Alibaba Vendor ID to linux/pci_ids.h >> PCI: move pci_clear_and_set_dword helper to pci header >> drivers/perf: add DesignWare PCIe PMU driver >> MAINTAINERS: add maintainers for DesignWare PCIe PMU driver > > As I mentioned earlier, I successfully tested your patchset with a few patches on top of it to enable DWC PCIe PMU on AmpereOne. Thus, feel free to add this tag to all the patches above: > > Tested-by: Ilkka Koskinen <ilkka@os.amperecomputing.com> > > Br, Ilkka Hi, Ilkka, Thank you. I will add your tested-by tag in the upcoming version v11. However, I kindly request some time to wait for feedback from esteemed maintainers such as Will, and Robin. Cheers, Shuai
On Thu, 16 Nov 2023, Shuai Xue wrote: > On 2023/11/16 08:57, Ilkka Koskinen wrote: >> >> Hi Shuai, >> >> On Sat, 4 Nov 2023, Shuai Xue wrote: >>> Change Log >>> ========== >>> >>> - move the &plat_dev->dev to previous line to warp more beautiful (Per Jonathan) >>> - rename error label with the same suffix 'err' (Per Jonathan) >>> - drop unnecessary else branch and return directly (Per Baolin) >>> - warp out set prev_count from dwc_pcie_pmu_set_period (Per Baolin) >>> - use PMU_FORMAT_ATTR to simplify format sysfs stuff (Per Will) >>> - export pci_clear_and_set_dword() to simplify _enable() functions (Per Will) >>> - simplify _read() function by unconditionally calculate with unit in bytes plused if branch for group#1 event (Per Will and Robin) >>> - simplify _update() function by unconditionally mask with 64-bit width plused if branch for lane event (Per Will) >>> - add type sanity check in _init() (Per Will) >>> - test with fuzzing tool before this new version (Per Will) >>> - register a platform device for each PCI device to probe RAS_DES PMU cap (Per Robin) >>> - add dwc_pcie_vendor_ids to extend vendor id for future added device (Per Krishna) >>> - pickup review-by tag from Baolin, Yicong and Jonathan >> >> ... >> >>> Shuai Xue (5): >>> docs: perf: Add description for Synopsys DesignWare PCIe PMU driver >>> PCI: Add Alibaba Vendor ID to linux/pci_ids.h >>> PCI: move pci_clear_and_set_dword helper to pci header >>> drivers/perf: add DesignWare PCIe PMU driver >>> MAINTAINERS: add maintainers for DesignWare PCIe PMU driver >> >> As I mentioned earlier, I successfully tested your patchset with a few patches on top of it to enable DWC PCIe PMU on AmpereOne. Thus, feel free to add this tag to all the patches above: >> >> Tested-by: Ilkka Koskinen <ilkka@os.amperecomputing.com> >> >> Br, Ilkka > > Hi, Ilkka, > > Thank you. > > I will add your tested-by tag in the upcoming version v11. However, I > kindly request some time to wait for feedback from esteemed maintainers such > as Will, and Robin. Sounds good. Actually, I can give you a little feedback on the driver patch meanwhile. --Ilkka > > Cheers, > Shuai >