Message ID | E1rVDmU-0027YP-Jz@rmk-PC.armlinux.org.uk |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:7301:2087:b0:106:209c:c626 with SMTP id gs7csp2018667dyb; Wed, 31 Jan 2024 08:50:52 -0800 (PST) X-Google-Smtp-Source: AGHT+IHjHG9SywYIiEESDYrBeua7NoOdd0gk8CE6Qd+dWfCTVjP5bdbi52jgOWPQ1+4oQpUjMm4d X-Received: by 2002:a17:906:c293:b0:a35:9513:4081 with SMTP id r19-20020a170906c29300b00a3595134081mr1214607ejz.14.1706719852469; Wed, 31 Jan 2024 08:50:52 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1706719852; cv=pass; d=google.com; s=arc-20160816; b=oJdCrrxAiE1wrfRPkqieM5gOJYblQ8Er+ZgWRXS8/DwN20RU5vPYnnL5+CTWMnT4M7 eS1GCzAiC/KR/eUXiZFaphZ8t8G0M1oDsdowxfEXp7bltSqEQkbi5qnOpXR7m20LHZ4y UWn723yND4IyEs15F3/d7UgU6A5XLEyKrIcTxI2AcWjYfVDmHR/QOLrkSoPYb79vfX9p /yMh3Fq8cwlegzFfF8ipd50jNRDTOn/KPJl6u2GLzsNQh7So4IXUUwyf7daecGDpoMQU VbpdZbImW6fz6RIX71E2sWkzF3iXfoD5UKTc4LdYuqhSb3aSv61OnPhKgPmohzQA/KIR udqA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=date:sender:message-id:content-transfer-encoding :content-disposition:mime-version:list-unsubscribe:list-subscribe :list-id:precedence:subject:cc:to:from:references:in-reply-to :dkim-signature; bh=4FviDP8LNcs5OH9XS9Lgt79F5tF9VpTSbqPgMlkIJeQ=; fh=vnkzdYJkOIHzReFs+rk+UfPEeI2iP1WJ98UAdqyTvcA=; b=vW/pIfkv3ZSrozk5f4JoBI3eB6TMhX07AUUGFAN7Bv2J6RRRwm0ZYq/8bM8pZIOuMp eG2GaH77nKwTaD4Gk5cXZoHzwenbL299vp7GMZDMftsH4oCkNs3iJt/bq2fwAJwN7lva zK1EsIfMdTiuVwyTg4SfenBw5ewXG+/R6X4PSD3ggyflwVyTbjCVkZcTkZPSbLTKfKrp fYqtwxrtfPFZI24Z+iIJiGllYabSZqZVqIHKj6IU7WCrCVxfYvSpbvhhwHvPAjhaP/0B bFMry5sgJQkNVnSAWtkIsRbogGTG/yK27DLQgS2XB/gGvgLuOaMpPHFcnTLzTQC4EIxf U0uw==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=fail (test mode) header.i=@armlinux.org.uk header.s=pandora-2019 header.b=a8T9+UFZ; arc=pass (i=1 dkim=pass dkdomain=armlinux.org.uk dmarc=pass fromdomain=armlinux.org.uk); spf=pass (google.com: domain of linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=armlinux.org.uk X-Forwarded-Encrypted: i=1; AJvYcCVX+ccYyQ4vyIF2y9giXgK9uk6/pX1H5Rc9izv6vcru8XsWLZ9CyvS33uDX+pIgPW6jx/0FEv9ibLY13sPB6aj9tfx+Yg== Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [147.75.80.249]) by mx.google.com with ESMTPS id zn6-20020a170906ff0600b00a2b61bd354fsi5746283ejb.283.2024.01.31.08.50.52 for <ouuuleilei@gmail.com> (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 31 Jan 2024 08:50:52 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) client-ip=147.75.80.249; Authentication-Results: mx.google.com; dkim=fail (test mode) header.i=@armlinux.org.uk header.s=pandora-2019 header.b=a8T9+UFZ; arc=pass (i=1 dkim=pass dkdomain=armlinux.org.uk dmarc=pass fromdomain=armlinux.org.uk); spf=pass (google.com: domain of linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-46827-ouuuleilei=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=armlinux.org.uk Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 132471F21D09 for <ouuuleilei@gmail.com>; Wed, 31 Jan 2024 16:50:52 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 1305312CD9C; Wed, 31 Jan 2024 16:50:02 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=fail reason="signature verification failed" (2048-bit key) header.d=armlinux.org.uk header.i=@armlinux.org.uk header.b="a8T9+UFZ" Received: from pandora.armlinux.org.uk (pandora.armlinux.org.uk [78.32.30.218]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 947BE12C536; Wed, 31 Jan 2024 16:49:56 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=78.32.30.218 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706719799; cv=none; b=jRWhM/5xumzIQ/Y6tX3slI2cX73Mwe8iikgB37TG9h/4MVmD573GbDDYN2B3Vvv4aMMHG17N34ZRxUBE3+Fyl+VavPYF2DRjkEvsx7nV9Q2P567xaJp2Al+wbgzQ3wh6/gKY2APdlmgxy4LRvGgJlnmqw5wCT2ZRT2e6id7MFjY= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706719799; c=relaxed/simple; bh=oglPI5j4SalFovZELHhLOqqFlAFobwCTFUVvb9MFaqQ=; h=In-Reply-To:References:From:To:Cc:Subject:MIME-Version: Content-Disposition:Content-Type:Message-Id:Date; b=FmVzCQ/0GI7ClN2NYUMN0Yiqrit1sSvbKN/Hu8LDRjng36Nko9carSiLUqV9KOpZjnadWATRoUHG+bAwW3e0HZsFee2o47S0CRKvh3C9eif6Z6KeJeL66Bu5HjgeaL+vD57GwOBtzayxotvEVBdmu3kaH6LcPKCu0142JLBgpAg= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=armlinux.org.uk; spf=none smtp.mailfrom=armlinux.org.uk; dkim=pass (2048-bit key) header.d=armlinux.org.uk header.i=@armlinux.org.uk header.b=a8T9+UFZ; arc=none smtp.client-ip=78.32.30.218 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=armlinux.org.uk Authentication-Results: smtp.subspace.kernel.org; spf=none smtp.mailfrom=armlinux.org.uk DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=armlinux.org.uk; s=pandora-2019; h=Date:Sender:Message-Id:Content-Type: Content-Transfer-Encoding:MIME-Version:Subject:Cc:To:From:References: In-Reply-To:Reply-To:Content-ID:Content-Description:Resent-Date:Resent-From: Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help: List-Unsubscribe:List-Subscribe:List-Post:List-Owner:List-Archive; bh=4FviDP8LNcs5OH9XS9Lgt79F5tF9VpTSbqPgMlkIJeQ=; b=a8T9+UFZh5Erxcl19hYu8LnPoN l8jkJTCH5Aga8DZg+RD7ezbHyq4OQdeKkSD6hzIRhev0EyFHx6P+dy6aq64Behg6QOGcwrXp31PcU 6AfPX8Jas4MhwqjfgDzG/9B7RObj1s/1widKHmpuiUi0IxczEPWs+On8ceu+z7LWHVipSzrH9j+MH aVvsx/9nVkddyN/EWYjMHlRk9wKElTZSmbUv2SfQdepcYbj+SFzv+s4g5BVbd1CpZAAuNL4pp1eWO 8kBc76pSd4xlVd4TjgTaVkulutmldzJsN1KRBnCwIlrQ+A3b+5ptlUeZJLMI+uVTpzpitQ9YbzG+U gmLrm9EQ==; Received: from e0022681537dd.dyn.armlinux.org.uk ([fd8f:7570:feb6:1:222:68ff:fe15:37dd]:36844 helo=rmk-PC.armlinux.org.uk) by pandora.armlinux.org.uk with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from <rmk@armlinux.org.uk>) id 1rVDmV-0003Tf-2S; Wed, 31 Jan 2024 16:49:47 +0000 Received: from rmk by rmk-PC.armlinux.org.uk with local (Exim 4.94.2) (envelope-from <rmk@rmk-PC.armlinux.org.uk>) id 1rVDmU-0027YP-Jz; Wed, 31 Jan 2024 16:49:46 +0000 In-Reply-To: <Zbp5xzmFhKDAgHws@shell.armlinux.org.uk> References: <Zbp5xzmFhKDAgHws@shell.armlinux.org.uk> From: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> To: linux-pm@vger.kernel.org, loongarch@lists.linux.dev, linux-acpi@vger.kernel.org, linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-riscv@lists.infradead.org, kvmarm@lists.linux.dev, x86@kernel.org, acpica-devel@lists.linuxfoundation.org, linux-csky@vger.kernel.org, linux-doc@vger.kernel.org, linux-ia64@vger.kernel.org, linux-parisc@vger.kernel.org Cc: Salil Mehta <salil.mehta@huawei.com>, Jean-Philippe Brucker <jean-philippe@linaro.org>, jianyong.wu@arm.com, justin.he@arm.com, James Morse <james.morse@arm.com>, Jonathan Cameron <Jonathan.Cameron@Huawei.com>, "Rafael J. Wysocki" <rafael@kernel.org> Subject: [PATCH RFC v4 02/15] ACPI: processor: Register all CPUs from acpi_processor_get_info() Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: <linux-kernel.vger.kernel.org> List-Subscribe: <mailto:linux-kernel+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-kernel+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Disposition: inline Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset="utf-8" Message-Id: <E1rVDmU-0027YP-Jz@rmk-PC.armlinux.org.uk> Sender: Russell King <rmk@armlinux.org.uk> Date: Wed, 31 Jan 2024 16:49:46 +0000 X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1789625475734081625 X-GMAIL-MSGID: 1789625475734081625 |
Series |
[RFC,v4,01/15] ACPI: Only enumerate enabled (or functional) processor devices
|
|
Commit Message
Russell King (Oracle)
Jan. 31, 2024, 4:49 p.m. UTC
From: James Morse <james.morse@arm.com> To allow ACPI to skip the call to arch_register_cpu() when the _STA value indicates the CPU can't be brought online right now, move the arch_register_cpu() call into acpi_processor_get_info(). Systems can still be booted with 'acpi=off', or not include an ACPI description at all. For these, the CPUs continue to be registered by cpu_dev_register_generic(). This moves the CPU register logic back to a subsys_initcall(), while the memory nodes will have been registered earlier. Signed-off-by: James Morse <james.morse@arm.com> Reviewed-by: Gavin Shan <gshan@redhat.com> Tested-by: Miguel Luis <miguel.luis@oracle.com> Tested-by: Vishnu Pajjuri <vishnu@os.amperecomputing.com> Tested-by: Jianyong Wu <jianyong.wu@arm.com> Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> --- Changes since RFC v2: * Fixup comment in acpi_processor_get_info() (Gavin Shan) * Add comment in cpu_dev_register_generic() (Gavin Shan) --- drivers/acpi/acpi_processor.c | 12 ++++++++++++ drivers/base/cpu.c | 6 +++++- 2 files changed, 17 insertions(+), 1 deletion(-)
Comments
On Wed, Jan 31, 2024 at 5:50 PM Russell King <rmk+kernel@armlinux.org.uk> wrote: > > From: James Morse <james.morse@arm.com> > > To allow ACPI to skip the call to arch_register_cpu() when the _STA > value indicates the CPU can't be brought online right now, move the > arch_register_cpu() call into acpi_processor_get_info(). > > Systems can still be booted with 'acpi=off', or not include an > ACPI description at all. For these, the CPUs continue to be > registered by cpu_dev_register_generic(). > > This moves the CPU register logic back to a subsys_initcall(), > while the memory nodes will have been registered earlier. > > Signed-off-by: James Morse <james.morse@arm.com> > Reviewed-by: Gavin Shan <gshan@redhat.com> > Tested-by: Miguel Luis <miguel.luis@oracle.com> > Tested-by: Vishnu Pajjuri <vishnu@os.amperecomputing.com> > Tested-by: Jianyong Wu <jianyong.wu@arm.com> > Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> > Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> > --- > Changes since RFC v2: > * Fixup comment in acpi_processor_get_info() (Gavin Shan) > * Add comment in cpu_dev_register_generic() (Gavin Shan) > --- > drivers/acpi/acpi_processor.c | 12 ++++++++++++ > drivers/base/cpu.c | 6 +++++- > 2 files changed, 17 insertions(+), 1 deletion(-) > > diff --git a/drivers/acpi/acpi_processor.c b/drivers/acpi/acpi_processor.c > index cf7c1cca69dd..a68c475cdea5 100644 > --- a/drivers/acpi/acpi_processor.c > +++ b/drivers/acpi/acpi_processor.c > @@ -314,6 +314,18 @@ static int acpi_processor_get_info(struct acpi_device *device) > cpufreq_add_device("acpi-cpufreq"); > } > > + /* > + * Register CPUs that are present. get_cpu_device() is used to skip > + * duplicate CPU descriptions from firmware. > + */ > + if (!invalid_logical_cpuid(pr->id) && cpu_present(pr->id) && > + !get_cpu_device(pr->id)) { > + int ret = arch_register_cpu(pr->id); > + > + if (ret) > + return ret; > + } > + > /* > * Extra Processor objects may be enumerated on MP systems with > * less than the max # of CPUs. They should be ignored _iff This is interesting, because right below there is the following code: if (invalid_logical_cpuid(pr->id) || !cpu_present(pr->id)) { int ret = acpi_processor_hotadd_init(pr); if (ret) return ret; } and acpi_processor_hotadd_init() essentially calls arch_register_cpu() with some extra things around it (more about that below). I do realize that acpi_processor_hotadd_init() is defined under CONFIG_ACPI_HOTPLUG_CPU, so for the sake of the argument let's consider an architecture where CONFIG_ACPI_HOTPLUG_CPU is set. So why are the two conditionals that almost contradict each other both needed? It looks like the new code could be combined with acpi_processor_hotadd_init() to do the right thing in all cases. Now, acpi_processor_hotadd_init() does some extra things that look like they should be done by the new code too. 1. It checks invalid_phys_cpuid() which appears to be a good idea to me. 2. It uses locking around arch_register_cpu() which doesn't seem unreasonable either. 3. It calls acpi_map_cpu() and I'm not sure why this is not done by the new code. The only thing that can be dropped from it is the _STA check AFAICS, because acpi_processor_add() won't even be called if the CPU is not present (and not enabled after the first patch). So why does the code not do 1 - 3 above? > diff --git a/drivers/base/cpu.c b/drivers/base/cpu.c > index 47de0f140ba6..13d052bf13f4 100644 > --- a/drivers/base/cpu.c > +++ b/drivers/base/cpu.c > @@ -553,7 +553,11 @@ static void __init cpu_dev_register_generic(void) > { > int i, ret; > > - if (!IS_ENABLED(CONFIG_GENERIC_CPU_DEVICES)) > + /* > + * When ACPI is enabled, CPUs are registered via > + * acpi_processor_get_info(). > + */ > + if (!IS_ENABLED(CONFIG_GENERIC_CPU_DEVICES) || !acpi_disabled) > return; Honestly, this looks like a quick hack to me and it absolutely requires an ACK from the x86 maintainers to go anywhere. > > for_each_present_cpu(i) { > --
On Thu, Feb 15, 2024 at 08:22:29PM +0100, Rafael J. Wysocki wrote: > On Wed, Jan 31, 2024 at 5:50 PM Russell King <rmk+kernel@armlinux.org.uk> wrote: > > diff --git a/drivers/acpi/acpi_processor.c b/drivers/acpi/acpi_processor.c > > index cf7c1cca69dd..a68c475cdea5 100644 > > --- a/drivers/acpi/acpi_processor.c > > +++ b/drivers/acpi/acpi_processor.c > > @@ -314,6 +314,18 @@ static int acpi_processor_get_info(struct acpi_device *device) > > cpufreq_add_device("acpi-cpufreq"); > > } > > > > + /* > > + * Register CPUs that are present. get_cpu_device() is used to skip > > + * duplicate CPU descriptions from firmware. > > + */ > > + if (!invalid_logical_cpuid(pr->id) && cpu_present(pr->id) && > > + !get_cpu_device(pr->id)) { > > + int ret = arch_register_cpu(pr->id); > > + > > + if (ret) > > + return ret; > > + } > > + > > /* > > * Extra Processor objects may be enumerated on MP systems with > > * less than the max # of CPUs. They should be ignored _iff > > This is interesting, because right below there is the following code: > > if (invalid_logical_cpuid(pr->id) || !cpu_present(pr->id)) { > int ret = acpi_processor_hotadd_init(pr); > > if (ret) > return ret; > } > > and acpi_processor_hotadd_init() essentially calls arch_register_cpu() > with some extra things around it (more about that below). > > I do realize that acpi_processor_hotadd_init() is defined under > CONFIG_ACPI_HOTPLUG_CPU, so for the sake of the argument let's > consider an architecture where CONFIG_ACPI_HOTPLUG_CPU is set. > > So why are the two conditionals that almost contradict each other both > needed? It looks like the new code could be combined with > acpi_processor_hotadd_init() to do the right thing in all cases. > > Now, acpi_processor_hotadd_init() does some extra things that look > like they should be done by the new code too. > > 1. It checks invalid_phys_cpuid() which appears to be a good idea to me. > > 2. It uses locking around arch_register_cpu() which doesn't seem > unreasonable either. > > 3. It calls acpi_map_cpu() and I'm not sure why this is not done by > the new code. > > The only thing that can be dropped from it is the _STA check AFAICS, > because acpi_processor_add() won't even be called if the CPU is not > present (and not enabled after the first patch). > > So why does the code not do 1 - 3 above? Honestly, I'm out of my depth with this and can't answer your questions - and I really don't want to try fiddling with this code because it's just too icky (even in its current form in mainline) to be understandable to anyone who hasn't gained a detailed knowledge of this code. It's going to require a lot of analysis - how acpi_map_cpuid() behaves in all circumstances, what this means for invalid_logical_cpuid() and invalid_phys_cpuid(), what paths will be taken in each case. This code is already just too hairy for someone who isn't an experienced ACPI hacker to be able to follow and I don't see an obvious way to make it more readable. James' additions make it even more complex and less readable.
On Tue, Feb 20, 2024 at 8:59 PM Rafael J. Wysocki <rafael@kernel.org> wrote: > > On Tue, Feb 20, 2024 at 5:24 PM Jonathan Cameron > <Jonathan.Cameron@huawei.com> wrote: > > > > On Tue, 20 Feb 2024 15:13:58 +0000 > > "Russell King (Oracle)" <linux@armlinux.org.uk> wrote: > > > > > On Tue, Feb 20, 2024 at 11:27:15AM +0000, Russell King (Oracle) wrote: > > > > On Thu, Feb 15, 2024 at 08:22:29PM +0100, Rafael J. Wysocki wrote: > > > > > On Wed, Jan 31, 2024 at 5:50 PM Russell King <rmk+kernel@armlinux.org.uk> wrote: > > > > > > diff --git a/drivers/acpi/acpi_processor.c b/drivers/acpi/acpi_processor.c > > > > > > index cf7c1cca69dd..a68c475cdea5 100644 > > > > > > --- a/drivers/acpi/acpi_processor.c > > > > > > +++ b/drivers/acpi/acpi_processor.c > > > > > > @@ -314,6 +314,18 @@ static int acpi_processor_get_info(struct acpi_device *device) > > > > > > cpufreq_add_device("acpi-cpufreq"); > > > > > > } > > > > > > > > > > > > + /* > > > > > > + * Register CPUs that are present. get_cpu_device() is used to skip > > > > > > + * duplicate CPU descriptions from firmware. > > > > > > + */ > > > > > > + if (!invalid_logical_cpuid(pr->id) && cpu_present(pr->id) && > > > > > > + !get_cpu_device(pr->id)) { > > > > > > + int ret = arch_register_cpu(pr->id); > > > > > > + > > > > > > + if (ret) > > > > > > + return ret; > > > > > > + } > > > > > > + > > > > > > /* > > > > > > * Extra Processor objects may be enumerated on MP systems with > > > > > > * less than the max # of CPUs. They should be ignored _iff > > > > > > > > > > This is interesting, because right below there is the following code: > > > > > > > > > > if (invalid_logical_cpuid(pr->id) || !cpu_present(pr->id)) { > > > > > int ret = acpi_processor_hotadd_init(pr); > > > > > > > > > > if (ret) > > > > > return ret; > > > > > } > > > > > > > > > > and acpi_processor_hotadd_init() essentially calls arch_register_cpu() > > > > > with some extra things around it (more about that below). > > > > > > > > > > I do realize that acpi_processor_hotadd_init() is defined under > > > > > CONFIG_ACPI_HOTPLUG_CPU, so for the sake of the argument let's > > > > > consider an architecture where CONFIG_ACPI_HOTPLUG_CPU is set. > > > > > > > > > > So why are the two conditionals that almost contradict each other both > > > > > needed? It looks like the new code could be combined with > > > > > acpi_processor_hotadd_init() to do the right thing in all cases. > > > > > > > > > > Now, acpi_processor_hotadd_init() does some extra things that look > > > > > like they should be done by the new code too. > > > > > > > > > > 1. It checks invalid_phys_cpuid() which appears to be a good idea to me. > > > > > > > > > > 2. It uses locking around arch_register_cpu() which doesn't seem > > > > > unreasonable either. > > > > > > > > > > 3. It calls acpi_map_cpu() and I'm not sure why this is not done by > > > > > the new code. > > > > > > > > > > The only thing that can be dropped from it is the _STA check AFAICS, > > > > > because acpi_processor_add() won't even be called if the CPU is not > > > > > present (and not enabled after the first patch). > > > > > > > > > > So why does the code not do 1 - 3 above? > > > > > > > > Honestly, I'm out of my depth with this and can't answer your > > > > questions - and I really don't want to try fiddling with this code > > > > because it's just too icky (even in its current form in mainline) > > > > to be understandable to anyone who hasn't gained a detailed knowledge > > > > of this code. > > > > > > > > It's going to require a lot of analysis - how acpi_map_cpuid() behaves > > > > in all circumstances, what this means for invalid_logical_cpuid() and > > > > invalid_phys_cpuid(), what paths will be taken in each case. This code > > > > is already just too hairy for someone who isn't an experienced ACPI > > > > hacker to be able to follow and I don't see an obvious way to make it > > > > more readable. > > > > > > > > James' additions make it even more complex and less readable. > > > > > > As an illustration of the problems I'm having here, I was just writing > > > a reply to this with a suggestion of transforming this code ultimately > > > to: > > > > > > if (!get_cpu_device(pr->id)) { > > > int ret; > > > > > > if (!invalid_logical_cpuid(pr->id) && cpu_present(pr->id)) > > > ret = acpi_processor_make_enabled(pr); > > > else > > > ret = acpi_processor_make_present(pr); > > > > > > if (ret) > > > return ret; > > > } > > > > > > (acpi_processor_make_present() would be acpi_processor_hotadd_init() > > > and acpi_processor_make_enabled() would be arch_register_cpu() at this > > > point.) > > > > > > Then I realised that's a bad idea - because we really need to check > > > that pr->id is valid before calling get_cpu_device() on it, so this > > > won't work. That leaves us with: > > > > > > int ret; > > > > > > if (invalid_logical_cpuid(pr->id) || !cpu_present(pr->id)) { > > > /* x86 et.al. path */ > > > ret = acpi_processor_make_present(pr); > > > } else if (!get_cpu_device(pr->id)) { > > > /* Arm64 path */ > > > ret = acpi_processor_make_enabled(pr); > > > } else { > > > ret = 0; > > > } > > > > > > if (ret) > > > return ret; > > > > > > Now, the next transformation would be to move !get_cpu_device(pr->id) > > > into acpi_processor_make_enabled() which would eliminate one of those > > > if() legs. > > > > > > Now, if we want to somehow make the call to arch_regster_cpu() common > > > in these two paths, the next question is what are the _precise_ > > > semantics of acpi_map_cpu(), particularly with respect to it > > > modifying pr->id. Is it guaranteed to always give the same result > > > for the same processor described in ACPI? What acpi_map_cpu() anyway, > > > I can find no documentation for it. > > > > > > Then there's the question whether calling acpi_unmap_cpu() should be > > > done on the failure path if arch_register_cpu() fails, which is done > > > for the x86 path but not the Arm64 path. Should it be done for the > > > Arm64 path? I've no idea, but as Arm64 doesn't implement either of > > > these two functions, I guess they could be stubbed out and thus be > > > no-ops - but then we open a hole where if pr->id is invalid, we > > > end up passing that invalid value to arch_register_cpu() which I'm > > > quite sure will explode with a negative CPU number. > > > > > > So, to my mind, what you're effectively asking for is a total rewrite > > > of all the code in and called by acpi_processor_get_info()... and that > > > is not something I am willing to do (because it's too far outside of > > > my knowledge area.) > > > > > > As I said in my reply to patch 1, I think your comments on patch 2 > > > make Arm64 vcpu hotplug unachievable in a reasonable time frame, and > > > certainly outside the bounds of what I can do to progress this. > > > > > > So, at this point I'm going to stand down from further participation > > > with this patch set as I believe I've reached the limit of what I can > > > do to progress it. > > > > > > > Thanks for your hard work on this Russell - we have moved forwards. > > > > Short of anyone else stepping up I'll pick this up with > > the help of some my colleagues. As such I'm keen on getting patch > > 1 upstream ASAP so that we can exclude the need for some of the > > other workarounds from earlier versions of this series (the ones > > dropped before now). > > Applied (as 6.9 material). And I'm going to drop it, because it is not correct. The problem is that it is going to affect non-processor devices, but let me comment on that patch itself.
diff --git a/drivers/acpi/acpi_processor.c b/drivers/acpi/acpi_processor.c index cf7c1cca69dd..a68c475cdea5 100644 --- a/drivers/acpi/acpi_processor.c +++ b/drivers/acpi/acpi_processor.c @@ -314,6 +314,18 @@ static int acpi_processor_get_info(struct acpi_device *device) cpufreq_add_device("acpi-cpufreq"); } + /* + * Register CPUs that are present. get_cpu_device() is used to skip + * duplicate CPU descriptions from firmware. + */ + if (!invalid_logical_cpuid(pr->id) && cpu_present(pr->id) && + !get_cpu_device(pr->id)) { + int ret = arch_register_cpu(pr->id); + + if (ret) + return ret; + } + /* * Extra Processor objects may be enumerated on MP systems with * less than the max # of CPUs. They should be ignored _iff diff --git a/drivers/base/cpu.c b/drivers/base/cpu.c index 47de0f140ba6..13d052bf13f4 100644 --- a/drivers/base/cpu.c +++ b/drivers/base/cpu.c @@ -553,7 +553,11 @@ static void __init cpu_dev_register_generic(void) { int i, ret; - if (!IS_ENABLED(CONFIG_GENERIC_CPU_DEVICES)) + /* + * When ACPI is enabled, CPUs are registered via + * acpi_processor_get_info(). + */ + if (!IS_ENABLED(CONFIG_GENERIC_CPU_DEVICES) || !acpi_disabled) return; for_each_present_cpu(i) {