Message ID | 1676978713-7394-1-git-send-email-quic_mojha@quicinc.com |
---|---|
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:adf:eb09:0:0:0:0:0 with SMTP id s9csp1811035wrn; Tue, 21 Feb 2023 03:33:12 -0800 (PST) X-Google-Smtp-Source: AK7set8PDnWADZL1iEJjAsdB2iwu66a19yvdTjqzH794FkhKcwIdpGJcSrmwoC7C6wxMf5z6vxjz X-Received: by 2002:aa7:98c3:0:b0:5a8:4ba7:5840 with SMTP id e3-20020aa798c3000000b005a84ba75840mr3336978pfm.26.1676979192336; Tue, 21 Feb 2023 03:33:12 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1676979192; cv=none; d=google.com; s=arc-20160816; b=DZ2xlWP5s4bV9fbrFYzZRvRyHQw1Nxy3Y007BDVY9lxFZkHdghfmKNmfd2nZMXazQ3 UVmnX6p9nhkuyLmtbbBsryeDhGyoIqEssAL6gtCZthDLe8qSUl/dR/iBDH7AXOsj8IsS JhLTZEO1bs7Ess7RpDmlw3dKEKg36QwZyNVKRqvbL9NHOktnzqDwuDKRii/iCISmNnU9 IzGmWukuSTMRnA3Obtt0KmPiEOvGePXWZDdiTqxQDK4oqVn6FSBYkoAGgjyAWfHTlFJO xTQAtb4k3WpSsI87qnGxF6pXpqdPWZAOkKGywiYqnnhlV0e0caPk0cqkBkO2EMkzG5yO 2Vsw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:subject:cc:to:from :dkim-signature; bh=/Wy6qoadl6SGbuKajU9rwIy3JAd97ZcZ5RR1eQKvzFI=; b=ROdV07T3dMiysr5anRByFuPxbcf+6kBieBGhgY5hiSDNRcmI98UIxKPP7vWCaHvZGB j06MEdT5StZjjXzuQr3PKeCpDP904oL4Tvbn5J+2QMhnzlEmxLsl0kmhJbR3vBTi5Y4s eWAVPHFzD7NFH3qAejVYoQ5BISXj6A07PyWf7TC5iwiAzqa6cFN9BLTXUmdmgWGvrbLa vQai5b/fGg+bv0XRtT7wjxlJyiicZ5hzQswU98fHXDoWDupJqqs+hMuTPYISexWI8Td7 6ka6iagXxBdctG1hfQFLgvl5SQQPlWGb5i8iCH050h72Mbe5Qr5jJoZNDDH6brbuCMhI UMaA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b="lvu4b4E/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t11-20020aa78f8b000000b005a8bc137b77si912584pfs.134.2023.02.21.03.32.59; Tue, 21 Feb 2023 03:33:12 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@quicinc.com header.s=qcppdkim1 header.b="lvu4b4E/"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=quicinc.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234090AbjBUL0d (ORCPT <rfc822;kautuk.consul.80@gmail.com> + 99 others); Tue, 21 Feb 2023 06:26:33 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50278 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233460AbjBUL0b (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Tue, 21 Feb 2023 06:26:31 -0500 Received: from mx0a-0031df01.pphosted.com (mx0a-0031df01.pphosted.com [205.220.168.131]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4FDA722DE6; Tue, 21 Feb 2023 03:26:30 -0800 (PST) Received: from pps.filterd (m0279862.ppops.net [127.0.0.1]) by mx0a-0031df01.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 31LARU9F024421; Tue, 21 Feb 2023 11:26:02 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=quicinc.com; h=from : to : cc : subject : date : message-id : mime-version : content-type; s=qcppdkim1; bh=/Wy6qoadl6SGbuKajU9rwIy3JAd97ZcZ5RR1eQKvzFI=; b=lvu4b4E/PSu/U2lJP8hvZUnw1LnDYl0Yax7e5bTb7h+95Yt+8yCH3y41ce6SeB0Y5Aox N4AorHuujyYghHj2fZSafxqd9HmiQjs0l5F93xUqp11sASABC/iV/IxFbYh+XblH3X7R QNf56+yh9B36WDG09G6pPiCk+4559y+oVU/zFICmCRuyEiVBVSJqwKhI6YGx+qn6gk+e P/GECR6Hvvcx0FeQLMMUf2ibbw6Bb+h17Tpo9S/87nIAOWfVCz20HdK+Hi2q2AXAIyVW Fm6/qB4WsySQcd7AuZxxpafRNaISL3hw05nLOwg7EPVPeMF+djIdGZSWgbtqi3en+UCY LQ== Received: from nasanppmta05.qualcomm.com (i-global254.qualcomm.com [199.106.103.254]) by mx0a-0031df01.pphosted.com (PPS) with ESMTPS id 3nvp4v0y6d-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Tue, 21 Feb 2023 11:26:01 +0000 Received: from nasanex01c.na.qualcomm.com (nasanex01c.na.qualcomm.com [10.45.79.139]) by NASANPPMTA05.qualcomm.com (8.17.1.5/8.17.1.5) with ESMTPS id 31LBQ1dW005630 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Tue, 21 Feb 2023 11:26:01 GMT Received: from hu-mojha-hyd.qualcomm.com (10.80.80.8) by nasanex01c.na.qualcomm.com (10.45.79.139) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.986.41; Tue, 21 Feb 2023 03:25:56 -0800 From: Mukesh Ojha <quic_mojha@quicinc.com> To: <agross@kernel.org>, <andersson@kernel.org>, <konrad.dybcio@linaro.org>, <keescook@chromium.org>, <tony.luck@intel.com>, <gpiccoli@igalia.com>, <catalin.marinas@arm.com>, <will@kernel.org> CC: <linux-arm-msm@vger.kernel.org>, <linux-remoteproc@vger.kernel.org>, <linux-kernel@vger.kernel.org>, <linux-hardening@vger.kernel.org>, <linux-arm-kernel@lists.infradead.org>, Mukesh Ojha <quic_mojha@quicinc.com> Subject: [RFC PATCH 0/6] Add basic Minidump kernel driver support Date: Tue, 21 Feb 2023 16:55:07 +0530 Message-ID: <1676978713-7394-1-git-send-email-quic_mojha@quicinc.com> X-Mailer: git-send-email 2.7.4 MIME-Version: 1.0 Content-Type: text/plain X-Originating-IP: [10.80.80.8] X-ClientProxiedBy: nasanex01b.na.qualcomm.com (10.46.141.250) To nasanex01c.na.qualcomm.com (10.45.79.139) X-QCInternal: smtphost X-Proofpoint-Virus-Version: vendor=nai engine=6200 definitions=5800 signatures=585085 X-Proofpoint-ORIG-GUID: a5P6fu8JZBUxR4Ov9Ud2zNc6pYHWhwNy X-Proofpoint-GUID: a5P6fu8JZBUxR4Ov9Ud2zNc6pYHWhwNy X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.219,Aquarius:18.0.930,Hydra:6.0.562,FMLib:17.11.170.22 definitions=2023-02-21_06,2023-02-20_02,2023-02-09_01 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxscore=0 mlxlogscore=999 clxscore=1015 adultscore=0 spamscore=0 phishscore=0 suspectscore=0 bulkscore=0 impostorscore=0 lowpriorityscore=0 malwarescore=0 priorityscore=1501 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2212070000 definitions=main-2302210098 X-Spam-Status: No, score=-2.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_LOW,SPF_HELO_NONE, SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1758440133143674981?= X-GMAIL-MSGID: =?utf-8?q?1758440133143674981?= |
Series |
Add basic Minidump kernel driver support
|
|
Message
Mukesh Ojha
Feb. 21, 2023, 11:25 a.m. UTC
Minidump is a best effort mechanism to collect useful and predefined data for first level of debugging on end user devices running on Qualcomm SoCs. It is built on the premise that System on Chip (SoC) or subsystem part of SoC crashes, due to a range of hardware and software bugs. Hence, the ability to collect accurate data is only a best-effort. The data collected could be invalid or corrupted, data collection itself could fail, and so on. Qualcomm devices in engineering mode provides a mechanism for generating full system ramdumps for post mortem debugging. But in some cases it's however not feasible to capture the entire content of RAM. The minidump mechanism provides the means for selecting which snippets should be included in the ramdump. The core of minidump feature is part of Qualcomm's boot firmware code. It initializes shared memory (SMEM), which is a part of DDR and allocates a small section of SMEM to minidump table i.e also called global table of content (G-ToC). Each subsystem (APSS, ADSP, ...) has their own table of segments to be included in the minidump and all get their reference from G-ToC. Each segment/region has some details like name, physical address and it's size etc. and it could be anywhere scattered in the DDR. Existing upstream Qualcomm remoteproc driver[1] already supports minidump feature for remoteproc instances like ADSP, MODEM, ... where predefined selective segments of subsystem region can be dumped as part of coredump collection which generates smaller size artifacts compared to complete coredump of subsystem on crash. [1] https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/remoteproc/qcom_common.c#n142 In addition to managing and querying the APSS minidump description, the Linux driver maintains a ELF header in a segment. This segment gets updated with section/program header whenever a new entry gets registered. Patch 1/6 is very trivial change. Patch 2/6 moves the minidump specific data structure and macro to qcom_minidump.h so that (3/6) minidump driver can use. Patch 3/6 implements qualcomm minidump kernel driver and exports symbol which other minidump kernel client can use. Patch 4/6 enables the qualcomm minidump driver. Patch 5/6 Use the exported symbol from minidump driver in qcom_common for querying minidump descriptor for a subsystem. Patch 6/6 Register pstore region with minidump. Testing of the patches has been done on sm8450 target with the help of out of tree patch which helps to set the download mode and storage type(on which dump will be saved) for which i will send separate series. Mukesh Ojha (6): remoteproc: qcom: Expand MD_* as MINIDUMP_* remoteproc: qcom: Move minidump specific data to qcom_minidump.h soc: qcom: Add Qualcomm minidump kernel driver arm64: defconfig: Enable Qualcomm minidump driver remoterproc: qcom: refactor to leverage exported minidump symbol pstore/ram: Register context with minidump arch/arm64/configs/defconfig | 1 + drivers/remoteproc/qcom_common.c | 75 +----- drivers/soc/qcom/Kconfig | 14 ++ drivers/soc/qcom/Makefile | 1 + drivers/soc/qcom/qcom_minidump.c | 490 +++++++++++++++++++++++++++++++++++++++++ fs/pstore/ram.c | 77 ++++++ include/soc/qcom/minidump.h | 40 ++++ include/soc/qcom/qcom_minidump.h | 88 +++++++ 8 files changed, 717 insertions(+), 69 deletions(-) create mode 100644 drivers/soc/qcom/qcom_minidump.c create mode 100644 include/soc/qcom/minidump.h create mode 100644 include/soc/qcom/qcom_minidump.h
Comments
On Tue, Feb 21, 2023 at 04:55:07PM +0530, Mukesh Ojha wrote: > Minidump is a best effort mechanism to collect useful and predefined data > for first level of debugging on end user devices running on Qualcomm SoCs. > It is built on the premise that System on Chip (SoC) or subsystem part of > SoC crashes, due to a range of hardware and software bugs. Hence, the > ability to collect accurate data is only a best-effort. The data collected > could be invalid or corrupted, data collection itself could fail, and so on. > > Qualcomm devices in engineering mode provides a mechanism for generating > full system ramdumps for post mortem debugging. But in some cases it's > however not feasible to capture the entire content of RAM. The minidump > mechanism provides the means for selecting which snippets should be > included in the ramdump. > > The core of minidump feature is part of Qualcomm's boot firmware code. > It initializes shared memory (SMEM), which is a part of DDR and > allocates a small section of SMEM to minidump table i.e also called > global table of content (G-ToC). Each subsystem (APSS, ADSP, ...) has > their own table of segments to be included in the minidump and all get > their reference from G-ToC. Each segment/region has some details like > name, physical address and it's size etc. and it could be anywhere > scattered in the DDR. > > Existing upstream Qualcomm remoteproc driver[1] already supports minidump > feature for remoteproc instances like ADSP, MODEM, ... where predefined > selective segments of subsystem region can be dumped as part of > coredump collection which generates smaller size artifacts compared to > complete coredump of subsystem on crash. > > [1] > https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/remoteproc/qcom_common.c#n142 > > In addition to managing and querying the APSS minidump description, > the Linux driver maintains a ELF header in a segment. This segment > gets updated with section/program header whenever a new entry gets > registered. I'd like to test this series plus your series that sets the multiple download modes. Can you include documentation about how to actually use this new feature? Also the information that you provided above is really useful. I think that should also go in the documentation file as well. I already have a reliable way to make a board go BOOM and go into ramdump mode. Brian
Thanks Brian for your interest in this series. On 2/23/2023 6:07 PM, Brian Masney wrote: > On Tue, Feb 21, 2023 at 04:55:07PM +0530, Mukesh Ojha wrote: >> Minidump is a best effort mechanism to collect useful and predefined data >> for first level of debugging on end user devices running on Qualcomm SoCs. >> It is built on the premise that System on Chip (SoC) or subsystem part of >> SoC crashes, due to a range of hardware and software bugs. Hence, the >> ability to collect accurate data is only a best-effort. The data collected >> could be invalid or corrupted, data collection itself could fail, and so on. >> >> Qualcomm devices in engineering mode provides a mechanism for generating >> full system ramdumps for post mortem debugging. But in some cases it's >> however not feasible to capture the entire content of RAM. The minidump >> mechanism provides the means for selecting which snippets should be >> included in the ramdump. >> >> The core of minidump feature is part of Qualcomm's boot firmware code. >> It initializes shared memory (SMEM), which is a part of DDR and >> allocates a small section of SMEM to minidump table i.e also called >> global table of content (G-ToC). Each subsystem (APSS, ADSP, ...) has >> their own table of segments to be included in the minidump and all get >> their reference from G-ToC. Each segment/region has some details like >> name, physical address and it's size etc. and it could be anywhere >> scattered in the DDR. >> >> Existing upstream Qualcomm remoteproc driver[1] already supports minidump >> feature for remoteproc instances like ADSP, MODEM, ... where predefined >> selective segments of subsystem region can be dumped as part of >> coredump collection which generates smaller size artifacts compared to >> complete coredump of subsystem on crash. >> >> [1] >> https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/remoteproc/qcom_common.c#n142 >> >> In addition to managing and querying the APSS minidump description, >> the Linux driver maintains a ELF header in a segment. This segment >> gets updated with section/program header whenever a new entry gets >> registered. > > I'd like to test this series plus your series that sets the multiple > download modes. Sure, you are welcome, but for that you need a device running with Qualcomm SoC and if it has a upstream support. Also, testing of this patch needs some minimal out of tree patches and i can help you with that. > Can you include documentation about how to actually use > this new feature? Will surely do, Since this is still RFC, and i am doubtful on the path of it in documentation directory. Also the information that you provided above is really > useful. I think that should also go in the documentation file as well. > > I already have a reliable way to make a board go BOOM and go into > ramdump mode. That's very nice to hear; but again if you can specify your target specification. -Mukesh > > Brian >
On 2/24/2023 2:40 AM, Mukesh Ojha wrote: > Thanks Brian for your interest in this series. > > On 2/23/2023 6:07 PM, Brian Masney wrote: >> On Tue, Feb 21, 2023 at 04:55:07PM +0530, Mukesh Ojha wrote: >>> Minidump is a best effort mechanism to collect useful and predefined >>> data >>> for first level of debugging on end user devices running on Qualcomm >>> SoCs. >>> It is built on the premise that System on Chip (SoC) or subsystem >>> part of >>> SoC crashes, due to a range of hardware and software bugs. Hence, the >>> ability to collect accurate data is only a best-effort. The data >>> collected >>> could be invalid or corrupted, data collection itself could fail, and >>> so on. >>> >>> Qualcomm devices in engineering mode provides a mechanism for generating >>> full system ramdumps for post mortem debugging. But in some cases it's >>> however not feasible to capture the entire content of RAM. The minidump >>> mechanism provides the means for selecting which snippets should be >>> included in the ramdump. >>> >>> The core of minidump feature is part of Qualcomm's boot firmware code. >>> It initializes shared memory (SMEM), which is a part of DDR and >>> allocates a small section of SMEM to minidump table i.e also called >>> global table of content (G-ToC). Each subsystem (APSS, ADSP, ...) has >>> their own table of segments to be included in the minidump and all get >>> their reference from G-ToC. Each segment/region has some details like >>> name, physical address and it's size etc. and it could be anywhere >>> scattered in the DDR. >>> >>> Existing upstream Qualcomm remoteproc driver[1] already supports >>> minidump >>> feature for remoteproc instances like ADSP, MODEM, ... where predefined >>> selective segments of subsystem region can be dumped as part of >>> coredump collection which generates smaller size artifacts compared to >>> complete coredump of subsystem on crash. >>> >>> [1] >>> https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/remoteproc/qcom_common.c#n142 >>> >>> In addition to managing and querying the APSS minidump description, >>> the Linux driver maintains a ELF header in a segment. This segment >>> gets updated with section/program header whenever a new entry gets >>> registered. >> >> I'd like to test this series plus your series that sets the multiple >> download modes. > > Sure, you are welcome, but for that you need a device running with > Qualcomm SoC and if it has a upstream support. > > Also, testing of this patch needs some minimal out of tree patches and > i can help you with that. > >> Can you include documentation about how to actually use >> this new feature? > > Will surely do, Since this is still RFC, and i am doubtful on the path > of it in documentation directory. This is RFC anyways, you can start w/ the directory which you think best fits here. The point here is to have the documentation file rather than path to be fixed. You can start w/ Documentation/features/debug and let's see what others have any suggestion. Please add a file in your next revision without worrying about the path for now. ---Trilok Soni
Hi Mukesh, On Fri, Feb 24, 2023 at 04:10:42PM +0530, Mukesh Ojha wrote: > On 2/23/2023 6:07 PM, Brian Masney wrote: > > I'd like to test this series plus your series that sets the multiple > > download modes. > > Sure, you are welcome, but for that you need a device running with Qualcomm > SoC and if it has a upstream support. I will be testing this series on a sa8540p (QDrive3 Automotive Development Board), which has the sc8280xp SoC with good upstream support. This is also the same board that I have a reliable way to make the board crash due to a known firmware bug. > Also, testing of this patch needs some minimal out of tree patches and > i can help you with that. Yup, that's fine. Hopefully we can also work to get those dependencies merged upstream as well. Brian
On 2/25/2023 12:36 AM, Brian Masney wrote: > Hi Mukesh, > > On Fri, Feb 24, 2023 at 04:10:42PM +0530, Mukesh Ojha wrote: >> On 2/23/2023 6:07 PM, Brian Masney wrote: >>> I'd like to test this series plus your series that sets the multiple >>> download modes. >> >> Sure, you are welcome, but for that you need a device running with Qualcomm >> SoC and if it has a upstream support. > > I will be testing this series on a sa8540p (QDrive3 Automotive > Development Board), which has the sc8280xp SoC with good upstream > support. This is also the same board that I have a reliable way to > make the board crash due to a known firmware bug. > Can you try below patch to just select minidump download mode and make the device crash ? --------------------------------------->8------------------------------- diff --git a/arch/arm64/boot/dts/qcom/sc8280xp.dtsi b/arch/arm64/boot/dts/qcom/sc8280xp.dtsi index 0d02599..bd8e1a8 100644 --- a/arch/arm64/boot/dts/qcom/sc8280xp.dtsi +++ b/arch/arm64/boot/dts/qcom/sc8280xp.dtsi @@ -280,6 +280,7 @@ firmware { scm: scm { compatible = "qcom,scm-sc8280xp", "qcom,scm"; + qcom,dload-mode = <&tcsr 0x13000>; }; }; diff --git a/drivers/firmware/qcom_scm.c b/drivers/firmware/qcom_scm.c index cdbfe54..e1539a2 100644 --- a/drivers/firmware/qcom_scm.c +++ b/drivers/firmware/qcom_scm.c @@ -20,7 +20,7 @@ #include "qcom_scm.h" -static bool download_mode = IS_ENABLED(CONFIG_QCOM_SCM_DOWNLOAD_MODE_DEFAULT); +static bool download_mode = true; module_param(download_mode, bool, 0); #define SCM_HAS_CORE_CLK BIT(0) @@ -427,7 +427,7 @@ static void qcom_scm_set_download_mode(bool enable) ret = __qcom_scm_set_dload_mode(__scm->dev, enable); } else if (__scm->dload_mode_addr) { ret = qcom_scm_io_writel(__scm->dload_mode_addr, - enable ? QCOM_SCM_BOOT_SET_DLOAD_MODE : 0); + enable ? 0x20 : 0); } else { dev_err(__scm->dev, "No available mechanism for setting download mode\n"); >> Also, testing of this patch needs some minimal out of tree patches and >> i can help you with that. > > Yup, that's fine. Hopefully we can also work to get those dependencies > merged upstream as well. > > Brian >
Friendly review reminder.. -Mukesh On 2/21/2023 4:55 PM, Mukesh Ojha wrote: > Minidump is a best effort mechanism to collect useful and predefined data > for first level of debugging on end user devices running on Qualcomm SoCs. > It is built on the premise that System on Chip (SoC) or subsystem part of > SoC crashes, due to a range of hardware and software bugs. Hence, the > ability to collect accurate data is only a best-effort. The data collected > could be invalid or corrupted, data collection itself could fail, and so on. > > Qualcomm devices in engineering mode provides a mechanism for generating > full system ramdumps for post mortem debugging. But in some cases it's > however not feasible to capture the entire content of RAM. The minidump > mechanism provides the means for selecting which snippets should be > included in the ramdump. > > The core of minidump feature is part of Qualcomm's boot firmware code. > It initializes shared memory (SMEM), which is a part of DDR and > allocates a small section of SMEM to minidump table i.e also called > global table of content (G-ToC). Each subsystem (APSS, ADSP, ...) has > their own table of segments to be included in the minidump and all get > their reference from G-ToC. Each segment/region has some details like > name, physical address and it's size etc. and it could be anywhere > scattered in the DDR. > > Existing upstream Qualcomm remoteproc driver[1] already supports minidump > feature for remoteproc instances like ADSP, MODEM, ... where predefined > selective segments of subsystem region can be dumped as part of > coredump collection which generates smaller size artifacts compared to > complete coredump of subsystem on crash. > > [1] > https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/remoteproc/qcom_common.c#n142 > > In addition to managing and querying the APSS minidump description, > the Linux driver maintains a ELF header in a segment. This segment > gets updated with section/program header whenever a new entry gets > registered. > > Patch 1/6 is very trivial change. > Patch 2/6 moves the minidump specific data structure and macro to > qcom_minidump.h so that (3/6) minidump driver can use. > Patch 3/6 implements qualcomm minidump kernel driver and exports > symbol which other minidump kernel client can use. > Patch 4/6 enables the qualcomm minidump driver. > Patch 5/6 Use the exported symbol from minidump driver in qcom_common > for querying minidump descriptor for a subsystem. > Patch 6/6 Register pstore region with minidump. > > Testing of the patches has been done on sm8450 target with the help > of out of tree patch which helps to set the download mode and storage > type(on which dump will be saved) for which i will send separate series. > > Mukesh Ojha (6): > remoteproc: qcom: Expand MD_* as MINIDUMP_* > remoteproc: qcom: Move minidump specific data to qcom_minidump.h > soc: qcom: Add Qualcomm minidump kernel driver > arm64: defconfig: Enable Qualcomm minidump driver > remoterproc: qcom: refactor to leverage exported minidump symbol > pstore/ram: Register context with minidump > > arch/arm64/configs/defconfig | 1 + > drivers/remoteproc/qcom_common.c | 75 +----- > drivers/soc/qcom/Kconfig | 14 ++ > drivers/soc/qcom/Makefile | 1 + > drivers/soc/qcom/qcom_minidump.c | 490 +++++++++++++++++++++++++++++++++++++++++ > fs/pstore/ram.c | 77 ++++++ > include/soc/qcom/minidump.h | 40 ++++ > include/soc/qcom/qcom_minidump.h | 88 +++++++ > 8 files changed, 717 insertions(+), 69 deletions(-) > create mode 100644 drivers/soc/qcom/qcom_minidump.c > create mode 100644 include/soc/qcom/minidump.h > create mode 100644 include/soc/qcom/qcom_minidump.h >
On Mon, Mar 06, 2023 at 08:58:04PM +0530, Mukesh Ojha wrote:
> Friendly review reminder..
It is a few hours after the merge window closed, please be patient.
And to help out, please review other submissions to reduce the review
load on maintainers. To not do that is just asking for others to do
work for you without any help, right?
thanks,
greg k-h
On Mon, Feb 27, 2023 at 03:45:31PM +0530, Mukesh Ojha wrote: > > > On 2/25/2023 12:36 AM, Brian Masney wrote: > > Hi Mukesh, > > > > On Fri, Feb 24, 2023 at 04:10:42PM +0530, Mukesh Ojha wrote: > > > On 2/23/2023 6:07 PM, Brian Masney wrote: > > > > I'd like to test this series plus your series that sets the multiple > > > > download modes. > > > > > > Sure, you are welcome, but for that you need a device running with Qualcomm > > > SoC and if it has a upstream support. > > > > I will be testing this series on a sa8540p (QDrive3 Automotive > > Development Board), which has the sc8280xp SoC with good upstream > > support. This is also the same board that I have a reliable way to > > make the board crash due to a known firmware bug. > > > > > Can you try below patch to just select minidump download mode and make the > device crash ? > > --------------------------------------->8------------------------------- > diff --git a/arch/arm64/boot/dts/qcom/sc8280xp.dtsi > b/arch/arm64/boot/dts/qcom/sc8280xp.dtsi > index 0d02599..bd8e1a8 100644 > --- a/arch/arm64/boot/dts/qcom/sc8280xp.dtsi > +++ b/arch/arm64/boot/dts/qcom/sc8280xp.dtsi > @@ -280,6 +280,7 @@ > firmware { > scm: scm { > compatible = "qcom,scm-sc8280xp", "qcom,scm"; > + qcom,dload-mode = <&tcsr 0x13000>; > }; > }; > > diff --git a/drivers/firmware/qcom_scm.c b/drivers/firmware/qcom_scm.c > index cdbfe54..e1539a2 100644 > --- a/drivers/firmware/qcom_scm.c > +++ b/drivers/firmware/qcom_scm.c > @@ -20,7 +20,7 @@ > > #include "qcom_scm.h" > > -static bool download_mode = > IS_ENABLED(CONFIG_QCOM_SCM_DOWNLOAD_MODE_DEFAULT); > +static bool download_mode = true; > module_param(download_mode, bool, 0); > > #define SCM_HAS_CORE_CLK BIT(0) > @@ -427,7 +427,7 @@ static void qcom_scm_set_download_mode(bool enable) > ret = __qcom_scm_set_dload_mode(__scm->dev, enable); > } else if (__scm->dload_mode_addr) { > ret = qcom_scm_io_writel(__scm->dload_mode_addr, > - enable ? QCOM_SCM_BOOT_SET_DLOAD_MODE : 0); > + enable ? 0x20 : 0); > } else { > dev_err(__scm->dev, > "No available mechanism for setting download > mode\n"); Hi Mukesh, I tried to test this series but I don't know how to actually use the minidump feature that's in this series. Some more documentation is needed. I added this series, plus your other series that adds the download modes to the SCM driver to my tree, along with your changes above. I downgraded the firmware on my sa8540p and I have my reproducible crash. Linux immediately loses control and the board firmware takes over. I assumed that I'd need to do a warm reboot so that DDR contents are still present so Linux can grab the memory contents on next reboot. However, 'fastboot devices' shows no devices so I can't reboot that way. I can do a cold boot but the DDR contents will be lost. Also this series needs to be rebased against 6.3rc1. Brian