[v4,2/2] arm64: kdump: Support crashkernel=X fall back to reserve region above DMA zones
Message ID | 20221116121044.1690-3-thunder.leizhen@huawei.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:6687:0:0:0:0:0 with SMTP id l7csp108439wru; Wed, 16 Nov 2022 04:21:44 -0800 (PST) X-Google-Smtp-Source: AA0mqf7V5JFjVZgQBJWFZK9vs324j9mZLKF5GMEaAFRr5snBf5ZykfuifiKTagoTbVG9vgHEzTRd X-Received: by 2002:aa7:ca50:0:b0:461:9d22:348d with SMTP id j16-20020aa7ca50000000b004619d22348dmr18206141edt.76.1668601304575; Wed, 16 Nov 2022 04:21:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1668601304; cv=none; d=google.com; s=arc-20160816; b=QGYMNvBaw/v8rq4xWKUvy/4Z5htpMK21wY5X98UBOqiWw4mJ6CRRhnJ6WH4svNtfJ7 t80nQAJ3dfWXkSoFeu6rrNxPOvWGexTe7Vub9iLbE3ukAjvLL1Kc11LX+TIGVsFCGAio FEO3F6aAn4rTElBY17wAassi0wHRz/0bKe7/gkxekV8FfqEFXYoc92CEdgD0ufcaNnqY i9rcpH32BJHHJnwc1IHjdw4sH4uFGCVfLIGHWhfA+cNhkg36HMSClv5F/jlkTrRzk3ff IwW29RieHwGlsq/lCRMxcXvHKfu7QPACzDyoy6WcX7lIacLg3eDPD95PdKbqIa/EkHma wADA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=04c7m+6BKiPdxIHCLywvReCNvsPX3tMl8QFiMeghMhY=; b=fPYhXWC7weOw5RuYOd569abs5d8Nvje9qa/x1OQZmZB6ffAwgNUh3nzb/TYxY7OO9N 52I00R/3/fFF7GHjqWsO7E150TVxWmPDdhgdjKWBZCG8+EulFRruKZLJ6qz8JReyA8RY WIYHwXLGsdmfP9vS+LEWoIPMx7qtihuMokZ26isxmu1xQKfYDMYad3pa5pOu624iz4LY vFRawYwAWtdio0N5oyaxvgwLZiCo8E4hM5zBE2wDDRhQ0+zUzCIHIIgs5mH4O9Rz5rTW CGpsV4TZk703XuULnur2rYRMXRj0COpBWNASuivQDzE6HAgqer8MkKOTyqSoTZqhiAJH SpVg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ji3-20020a170907980300b007ae7d12bef1si14770882ejc.909.2022.11.16.04.21.18; Wed, 16 Nov 2022 04:21:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233299AbiKPMRp (ORCPT <rfc822;just.gull.subs@gmail.com> + 99 others); Wed, 16 Nov 2022 07:17:45 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60992 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233201AbiKPMQa (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Wed, 16 Nov 2022 07:16:30 -0500 Received: from szxga08-in.huawei.com (szxga08-in.huawei.com [45.249.212.255]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id BD898192A0; Wed, 16 Nov 2022 04:10:59 -0800 (PST) Received: from dggpemm500023.china.huawei.com (unknown [172.30.72.55]) by szxga08-in.huawei.com (SkyGuard) with ESMTP id 4NC21m3JKWz15Mh0; Wed, 16 Nov 2022 20:10:36 +0800 (CST) Received: from dggpemm500006.china.huawei.com (7.185.36.236) by dggpemm500023.china.huawei.com (7.185.36.83) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Wed, 16 Nov 2022 20:10:58 +0800 Received: from thunder-town.china.huawei.com (10.174.178.55) by dggpemm500006.china.huawei.com (7.185.36.236) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.31; Wed, 16 Nov 2022 20:10:57 +0800 From: Zhen Lei <thunder.leizhen@huawei.com> To: Dave Young <dyoung@redhat.com>, Baoquan He <bhe@redhat.com>, Vivek Goyal <vgoyal@redhat.com>, <kexec@lists.infradead.org>, <linux-kernel@vger.kernel.org>, Catalin Marinas <catalin.marinas@arm.com>, Will Deacon <will@kernel.org>, <linux-arm-kernel@lists.infradead.org>, Jonathan Corbet <corbet@lwn.net>, <linux-doc@vger.kernel.org> CC: Zhen Lei <thunder.leizhen@huawei.com>, "Eric W . Biederman" <ebiederm@xmission.com>, Randy Dunlap <rdunlap@infradead.org>, Feng Zhou <zhoufeng.zf@bytedance.com>, Kefeng Wang <wangkefeng.wang@huawei.com>, "Chen Zhou" <dingguo.cz@antgroup.com>, John Donnelly <John.p.donnelly@oracle.com>, Dave Kleikamp <dave.kleikamp@oracle.com> Subject: [PATCH v4 2/2] arm64: kdump: Support crashkernel=X fall back to reserve region above DMA zones Date: Wed, 16 Nov 2022 20:10:44 +0800 Message-ID: <20221116121044.1690-3-thunder.leizhen@huawei.com> X-Mailer: git-send-email 2.37.3.windows.1 In-Reply-To: <20221116121044.1690-1-thunder.leizhen@huawei.com> References: <20221116121044.1690-1-thunder.leizhen@huawei.com> MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.174.178.55] X-ClientProxiedBy: dggems704-chm.china.huawei.com (10.3.19.181) To dggpemm500006.china.huawei.com (7.185.36.236) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1749655281147066385?= X-GMAIL-MSGID: =?utf-8?q?1749655281147066385?= |
Series |
arm64: kdump: Function supplement and performance optimization
|
|
Commit Message
Zhen Lei
Nov. 16, 2022, 12:10 p.m. UTC
For crashkernel=X without '@offset', select a region within DMA zones first, and fall back to reserve region above DMA zones. This allows users to use the same configuration on multiple platforms. Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com> Acked-by: Baoquan He <bhe@redhat.com> --- Documentation/admin-guide/kernel-parameters.txt | 2 +- arch/arm64/mm/init.c | 17 ++++++++++++++++- 2 files changed, 17 insertions(+), 2 deletions(-)
Comments
On Wed, Nov 16, 2022 at 08:10:44PM +0800, Zhen Lei wrote: > For crashkernel=X without '@offset', select a region within DMA zones > first, and fall back to reserve region above DMA zones. This allows > users to use the same configuration on multiple platforms. > > Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com> > Acked-by: Baoquan He <bhe@redhat.com> Reviewed-by: Catalin Marinas <catalin.marinas@arm.com>
On 11/16/22 at 08:10pm, Zhen Lei wrote: > For crashkernel=X without '@offset', select a region within DMA zones > first, and fall back to reserve region above DMA zones. This allows > users to use the same configuration on multiple platforms. > > Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com> > Acked-by: Baoquan He <bhe@redhat.com> > --- > Documentation/admin-guide/kernel-parameters.txt | 2 +- > arch/arm64/mm/init.c | 17 ++++++++++++++++- > 2 files changed, 17 insertions(+), 2 deletions(-) > > diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt > index a7b7147447b8bf8..ef6d922ed26b9dc 100644 > --- a/Documentation/admin-guide/kernel-parameters.txt > +++ b/Documentation/admin-guide/kernel-parameters.txt > @@ -842,7 +842,7 @@ > memory region [offset, offset + size] for that kernel > image. If '@offset' is omitted, then a suitable offset > is selected automatically. > - [KNL, X86-64] Select a region under 4G first, and > + [KNL, X86-64, ARM64] Select a region under 4G first, and > fall back to reserve region above 4G when '@offset' > hasn't been specified. > See Documentation/admin-guide/kdump/kdump.rst for further details. > diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c > index ba7227179822d10..58a0bb2c17f18cf 100644 > --- a/arch/arm64/mm/init.c > +++ b/arch/arm64/mm/init.c > @@ -132,6 +132,7 @@ static void __init reserve_crashkernel(void) > unsigned long long crash_max = CRASH_ADDR_LOW_MAX; > char *cmdline = boot_command_line; > int ret; > + bool fixed_base = false; > > if (!IS_ENABLED(CONFIG_KEXEC_CORE)) > return; > @@ -163,12 +164,26 @@ static void __init reserve_crashkernel(void) > crash_size = PAGE_ALIGN(crash_size); > > /* User specifies base address explicitly. */ > - if (crash_base) > + if (crash_base) { > + fixed_base = true; > crash_max = crash_base + crash_size; > + } > > +retry: > crash_base = memblock_phys_alloc_range(crash_size, CRASH_ALIGN, > crash_base, crash_max); This pachset looks good to me. While we observed a unexpected case, where a reserved region cross the high and low meomry region. I noticed Catalin has pointed that out. Even with the suggested code, we could have the kernel parameters like: crashkernel=512M,high crashkernel=128M,low, we finally have [4G-126M, 4G+386M], [1G, 1G+128M] regions in running kernel. This looks strange. I am wondering if we can specify explicit search_base in memblock_phys_alloc_range() to avoid above case. Like this, crashkernel,high region will only exist in high memory, crashkernel,low region only exists in low memory region. I made a draft patch based on this patchset to present what the code looks like. diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c index 58a0bb2c17f1..fd9d35e17a62 100644 --- a/arch/arm64/mm/init.c +++ b/arch/arm64/mm/init.c @@ -127,7 +127,7 @@ static int __init reserve_crashkernel_low(unsigned long long low_size) */ static void __init reserve_crashkernel(void) { - unsigned long long crash_base, crash_size; + unsigned long long crash_base, crash_size, search_base; unsigned long long crash_low_size = 0; unsigned long long crash_max = CRASH_ADDR_LOW_MAX; char *cmdline = boot_command_line; @@ -155,6 +155,7 @@ static void __init reserve_crashkernel(void) else if (ret) return; + search_base = CRASH_ADDR_LOW_MAX; crash_max = CRASH_ADDR_HIGH_MAX; } else if (ret || !crash_size) { /* The specified value is invalid */ @@ -166,12 +167,13 @@ static void __init reserve_crashkernel(void) /* User specifies base address explicitly. */ if (crash_base) { fixed_base = true; + search_base = crash_base; crash_max = crash_base + crash_size; } retry: crash_base = memblock_phys_alloc_range(crash_size, CRASH_ALIGN, - crash_base, crash_max); + search_base, crash_max); if (!crash_base) { /* * If the first attempt was for low memory, fall back to @@ -180,6 +182,7 @@ static void __init reserve_crashkernel(void) */ if (!fixed_base && (crash_max == CRASH_ADDR_LOW_MAX)) { crash_max = CRASH_ADDR_HIGH_MAX; + search_base = CRASH_ADDR_LOW_MAX; crash_low_size = DEFAULT_CRASH_KERNEL_LOW_SIZE; goto retry; } @@ -189,7 +192,7 @@ static void __init reserve_crashkernel(void) return; } - if ((crash_base > CRASH_ADDR_LOW_MAX - crash_low_size) && + if ((crash_base > CRASH_ADDR_LOW_MAX) && crash_low_size && reserve_crashkernel_low(crash_low_size)) { memblock_phys_free(crash_base, crash_size); return;
On 2022/11/21 16:31, Baoquan He wrote: > On 11/16/22 at 08:10pm, Zhen Lei wrote: >> For crashkernel=X without '@offset', select a region within DMA zones >> first, and fall back to reserve region above DMA zones. This allows >> users to use the same configuration on multiple platforms. >> >> Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com> >> Acked-by: Baoquan He <bhe@redhat.com> >> --- >> Documentation/admin-guide/kernel-parameters.txt | 2 +- >> arch/arm64/mm/init.c | 17 ++++++++++++++++- >> 2 files changed, 17 insertions(+), 2 deletions(-) >> >> diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt >> index a7b7147447b8bf8..ef6d922ed26b9dc 100644 >> --- a/Documentation/admin-guide/kernel-parameters.txt >> +++ b/Documentation/admin-guide/kernel-parameters.txt >> @@ -842,7 +842,7 @@ >> memory region [offset, offset + size] for that kernel >> image. If '@offset' is omitted, then a suitable offset >> is selected automatically. >> - [KNL, X86-64] Select a region under 4G first, and >> + [KNL, X86-64, ARM64] Select a region under 4G first, and >> fall back to reserve region above 4G when '@offset' >> hasn't been specified. >> See Documentation/admin-guide/kdump/kdump.rst for further details. >> diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c >> index ba7227179822d10..58a0bb2c17f18cf 100644 >> --- a/arch/arm64/mm/init.c >> +++ b/arch/arm64/mm/init.c >> @@ -132,6 +132,7 @@ static void __init reserve_crashkernel(void) >> unsigned long long crash_max = CRASH_ADDR_LOW_MAX; >> char *cmdline = boot_command_line; >> int ret; >> + bool fixed_base = false; >> >> if (!IS_ENABLED(CONFIG_KEXEC_CORE)) >> return; >> @@ -163,12 +164,26 @@ static void __init reserve_crashkernel(void) >> crash_size = PAGE_ALIGN(crash_size); >> >> /* User specifies base address explicitly. */ >> - if (crash_base) >> + if (crash_base) { >> + fixed_base = true; >> crash_max = crash_base + crash_size; >> + } >> >> +retry: >> crash_base = memblock_phys_alloc_range(crash_size, CRASH_ALIGN, >> crash_base, crash_max); > > This pachset looks good to me. While we observed a unexpected case, > where a reserved region cross the high and low meomry region. I noticed > Catalin has pointed that out. Even with the suggested code, we could > have the kernel parameters like: crashkernel=512M,high > crashkernel=128M,low, we finally have [4G-126M, 4G+386M], [1G, 1G+128M] > regions in running kernel. This looks strange. Is high-end memory fragmented? Add boot option memblock=debug and watch? > > I am wondering if we can specify explicit search_base in > memblock_phys_alloc_range() to avoid above case. Like this, > crashkernel,high region will only exist in high memory, crashkernel,low > region only exists in low memory region. I made a draft patch based on > this patchset to present what the code looks like. Looks good to me. > > diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c > index 58a0bb2c17f1..fd9d35e17a62 100644 > --- a/arch/arm64/mm/init.c > +++ b/arch/arm64/mm/init.c > @@ -127,7 +127,7 @@ static int __init reserve_crashkernel_low(unsigned long long low_size) > */ > static void __init reserve_crashkernel(void) > { > - unsigned long long crash_base, crash_size; > + unsigned long long crash_base, crash_size, search_base; search_base needs to be initialized to 0. > unsigned long long crash_low_size = 0; > unsigned long long crash_max = CRASH_ADDR_LOW_MAX; > char *cmdline = boot_command_line; > @@ -155,6 +155,7 @@ static void __init reserve_crashkernel(void) > else if (ret) > return; > > + search_base = CRASH_ADDR_LOW_MAX; > crash_max = CRASH_ADDR_HIGH_MAX; > } else if (ret || !crash_size) { > /* The specified value is invalid */ > @@ -166,12 +167,13 @@ static void __init reserve_crashkernel(void) > /* User specifies base address explicitly. */ > if (crash_base) { > fixed_base = true; > + search_base = crash_base; > crash_max = crash_base + crash_size; > } > > retry: > crash_base = memblock_phys_alloc_range(crash_size, CRASH_ALIGN, > - crash_base, crash_max); > + search_base, crash_max); > if (!crash_base) { > /* > * If the first attempt was for low memory, fall back to > @@ -180,6 +182,7 @@ static void __init reserve_crashkernel(void) > */ > if (!fixed_base && (crash_max == CRASH_ADDR_LOW_MAX)) { > crash_max = CRASH_ADDR_HIGH_MAX; > + search_base = CRASH_ADDR_LOW_MAX; > crash_low_size = DEFAULT_CRASH_KERNEL_LOW_SIZE; > goto retry; > } > @@ -189,7 +192,7 @@ static void __init reserve_crashkernel(void) > return; > } > > - if ((crash_base > CRASH_ADDR_LOW_MAX - crash_low_size) && > + if ((crash_base > CRASH_ADDR_LOW_MAX) && > crash_low_size && reserve_crashkernel_low(crash_low_size)) { > memblock_phys_free(crash_base, crash_size); > return; > > . >
diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt index a7b7147447b8bf8..ef6d922ed26b9dc 100644 --- a/Documentation/admin-guide/kernel-parameters.txt +++ b/Documentation/admin-guide/kernel-parameters.txt @@ -842,7 +842,7 @@ memory region [offset, offset + size] for that kernel image. If '@offset' is omitted, then a suitable offset is selected automatically. - [KNL, X86-64] Select a region under 4G first, and + [KNL, X86-64, ARM64] Select a region under 4G first, and fall back to reserve region above 4G when '@offset' hasn't been specified. See Documentation/admin-guide/kdump/kdump.rst for further details. diff --git a/arch/arm64/mm/init.c b/arch/arm64/mm/init.c index ba7227179822d10..58a0bb2c17f18cf 100644 --- a/arch/arm64/mm/init.c +++ b/arch/arm64/mm/init.c @@ -132,6 +132,7 @@ static void __init reserve_crashkernel(void) unsigned long long crash_max = CRASH_ADDR_LOW_MAX; char *cmdline = boot_command_line; int ret; + bool fixed_base = false; if (!IS_ENABLED(CONFIG_KEXEC_CORE)) return; @@ -163,12 +164,26 @@ static void __init reserve_crashkernel(void) crash_size = PAGE_ALIGN(crash_size); /* User specifies base address explicitly. */ - if (crash_base) + if (crash_base) { + fixed_base = true; crash_max = crash_base + crash_size; + } +retry: crash_base = memblock_phys_alloc_range(crash_size, CRASH_ALIGN, crash_base, crash_max); if (!crash_base) { + /* + * If the first attempt was for low memory, fall back to + * high memory, the minimum required low memory will be + * reserved later. + */ + if (!fixed_base && (crash_max == CRASH_ADDR_LOW_MAX)) { + crash_max = CRASH_ADDR_HIGH_MAX; + crash_low_size = DEFAULT_CRASH_KERNEL_LOW_SIZE; + goto retry; + } + pr_warn("cannot allocate crashkernel (size:0x%llx)\n", crash_size); return;