Message ID | 20240201031950.3225626-1-maobibo@loongson.cn |
---|---|
Headers |
Return-Path: <linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:693c:2685:b0:106:209c:c626 with SMTP id mn5csp180756dyc; Wed, 31 Jan 2024 19:21:01 -0800 (PST) X-Google-Smtp-Source: AGHT+IHhR3tWjGr5/0L4j/0Xd3cRjq3YFzAR6F40FiUvrWsSaeJJkB9e1G21ww99r84ntmJ8NBFO X-Received: by 2002:a05:651c:109:b0:2cf:1dc1:7faa with SMTP id a9-20020a05651c010900b002cf1dc17faamr2115912ljb.16.1706757661667; Wed, 31 Jan 2024 19:21:01 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1706757661; cv=pass; d=google.com; s=arc-20160816; b=Ricj2x5bUQ22LD8SRCWb8C00NNk5eeYnHsi8L5CiM80rMycreHrAwMbrO1YnJhGGs+ wErgQ/1aregN7mG4HSKdtx5wjqVlRQQ+9/HxJ6TQtXsUFKqqq4Yw5QmWL4IKuD8x6EF9 rVfq3S5GSpUrswmTO5i/6PSc5bljPGVSq70BgpPwu6c9UDiHbm6EW1Lay08VOWkDLPSj JDOA3pBizxzNUngRvDEKmt4s5LpE7284wM2n19utdcGmRNOaoicjxehmHFqK7Ll5hGOj AaL8YPiUNHs6Y9biySyXDVpgkMUvRwfl06f5BFsw0pADni55RzBE3Uib5ys9dqq4ebee sAHA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:message-id:date:subject:cc:to :from; bh=dCBn9lzN3pwftwtS/3P6lEp7st1J/DRzGCc6tqhwRK0=; fh=7lMESJgNqA+QgnZhL2ZMK9gnlEV9+ZI1NSMGUG5cdco=; b=TKabLgAdvbrkTL0XYyoU5jj4uxSpIcU26+DWfiXZ7TF2bD50Z71hey04p3HepE1vd/ u9MQ4x7nqlMN0NsqoMeHiHY36uGOt7EXgYJnl6cNCU1f1vTOjV8urLOwhEOKjgAjPUKD 05AVj8nmwiYuqgy8+LX6oL2pSAeyE0VZGXgJBBeNBqC+5wPGao/ouaXoHZ+xNR8HM5ze sRaLJcBcVaE7EPK2FQMQEtjVuQXci/PmpCjcqBhxNjJybHV6iOK2x1mQt4kkA/qARW9m hM5P5+fe6JTMOSGddQM1jTWjJ1yFnIv7MeWT2tgJarumt7rS6R7+n2f74uG65cJ7cLrI 3O4A==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=loongson.cn); spf=pass (google.com: domain of linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org" X-Forwarded-Encrypted: i=1; AJvYcCWwlu5qFrFe0YWfBeR1CwXqn+4TOgU+L62+S5hnMVI1KowTGm6y4GcSKkn4l0U1ncnbnvOvbEXx+dxgjWrYuKePGkvI6A== Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id x6-20020a056402414600b0055f81b7d1ccsi1461516eda.394.2024.01.31.19.21.01 for <ouuuleilei@gmail.com> (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 31 Jan 2024 19:21:01 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=loongson.cn); spf=pass (google.com: domain of linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-47564-ouuuleilei=gmail.com@vger.kernel.org" Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 1DB5D1F2BF3F for <ouuuleilei@gmail.com>; Thu, 1 Feb 2024 03:21:01 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 354853D0A6; Thu, 1 Feb 2024 03:19:58 +0000 (UTC) Received: from mail.loongson.cn (mail.loongson.cn [114.242.206.163]) by smtp.subspace.kernel.org (Postfix) with ESMTP id A0A553B293; Thu, 1 Feb 2024 03:19:53 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=114.242.206.163 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706757596; cv=none; b=rTtTXskSoK1wTiawo4L35UUBB6evgw6ooHN4fhebOUEAjUwF93vWa43fmlxUPDJ2cyqKwtxeFDuDLa6/lRP/68mhmivZqZvh7H3hrf4dW+xwxVCyv/4t14kSER30/3+9FlSYuj3z+pYlN2C2y+VSC4d8eNiXD4m1BD5+vK0L5kE= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1706757596; c=relaxed/simple; bh=+9YB0WxuNQOB7N11Y9bprkm23A0whVNmF4nZDYquEGw=; h=From:To:Cc:Subject:Date:Message-Id:MIME-Version; b=Fvaqg4XrAoA/q+KqbPiqDKqYAF5l8WnCb39MQ5LYzgw8SjRm3e3ytWymEC8uit07qWPci7sfT1Obj77TGzXuCmWQvMt0wwXHeSWcF7jiDh/Vw2pTNzzoyQPLLfEyKEIefY3rVYe54gnJrcJqVFOy1kAkbLwdrrlB/8ywswQVuPE= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=loongson.cn; spf=pass smtp.mailfrom=loongson.cn; arc=none smtp.client-ip=114.242.206.163 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=loongson.cn Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=loongson.cn Received: from loongson.cn (unknown [10.2.5.213]) by gateway (Coremail) with SMTP id _____8CxifDXDbtlLFQJAA--.27670S3; Thu, 01 Feb 2024 11:19:51 +0800 (CST) Received: from localhost.localdomain (unknown [10.2.5.213]) by localhost.localdomain (Coremail) with SMTP id AQAAf8DxfRPWDbtltkIrAA--.3273S2; Thu, 01 Feb 2024 11:19:50 +0800 (CST) From: Bibo Mao <maobibo@loongson.cn> To: Huacai Chen <chenhuacai@kernel.org>, Tianrui Zhao <zhaotianrui@loongson.cn>, Juergen Gross <jgross@suse.com>, Paolo Bonzini <pbonzini@redhat.com> Cc: loongarch@lists.linux.dev, linux-kernel@vger.kernel.org, virtualization@lists.linux.dev, kvm@vger.kernel.org Subject: [PATCH v4 0/6] LoongArch: Add pv ipi support on LoongArch VM Date: Thu, 1 Feb 2024 11:19:44 +0800 Message-Id: <20240201031950.3225626-1-maobibo@loongson.cn> X-Mailer: git-send-email 2.39.3 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: <linux-kernel.vger.kernel.org> List-Subscribe: <mailto:linux-kernel+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-kernel+unsubscribe@vger.kernel.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-CM-TRANSID: AQAAf8DxfRPWDbtltkIrAA--.3273S2 X-CM-SenderInfo: xpdruxter6z05rqj20fqof0/ X-Coremail-Antispam: 1Uk129KBj93XoWxCw47WF1kWF1kuFyUGrW8Zrc_yoWrXFW5pF W7urn5WFs5Gr93Zwnxt3s3ur15Jw1xG34aq3W2yrW8C3y2qFyUXr4kGr98Za4kJw4rJrW0 qF1rGw1YgF1UA3XCm3ZEXasCq-sJn29KB7ZKAUJUUUUr529EdanIXcx71UUUUU7KY7ZEXa sCq-sGcSsGvfJ3Ic02F40EFcxC0VAKzVAqx4xG6I80ebIjqfuFe4nvWSU5nxnvy29KBjDU 0xBIdaVrnRJUUU90b4IE77IF4wAFF20E14v26r1j6r4UM7CY07I20VC2zVCF04k26cxKx2 IYs7xG6rWj6s0DM7CIcVAFz4kK6r1Y6r17M28lY4IEw2IIxxk0rwA2F7IY1VAKz4vEj48v e4kI8wA2z4x0Y4vE2Ix0cI8IcVAFwI0_Gr0_Xr1l84ACjcxK6xIIjxv20xvEc7CjxVAFwI 0_Gr0_Cr1l84ACjcxK6I8E87Iv67AKxVWxJVW8Jr1l84ACjcxK6I8E87Iv6xkF7I0E14v2 6r4UJVWxJr1ln4kS14v26r1Y6r17M2AIxVAIcxkEcVAq07x20xvEncxIr21l57IF6xkI12 xvs2x26I8E6xACxx1l5I8CrVACY4xI64kE6c02F40Ex7xfMcIj6xIIjxv20xvE14v26r1Y 6r17McIj6I8E87Iv67AKxVWUJVW8JwAm72CE4IkC6x0Yz7v_Jr0_Gr1lF7xvr2IYc2Ij64 vIr41l42xK82IYc2Ij64vIr41l4I8I3I0E4IkC6x0Yz7v_Jr0_Gr1l4IxYO2xFxVAFwI0_ Jrv_JF1lx2IqxVAqx4xG67AKxVWUJVWUGwC20s026x8GjcxK67AKxVWUGVWUWwC2zVAF1V AY17CE14v26r1q6r43MIIYrxkI7VAKI48JMIIF0xvE2Ix0cI8IcVAFwI0_Jr0_JF4lIxAI cVC0I7IYx2IY6xkF7I0E14v26r1j6r4UMIIF0xvE42xK8VAvwI8IcIk0rVWUJVWUCwCI42 IY6I8E87Iv67AKxVWUJVW8JwCI42IY6I8E87Iv6xkF7I0E14v26r1j6r4UYxBIdaVFxhVj vjDU0xZFpf9x07jepB-UUUUU= X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1789665121817733366 X-GMAIL-MSGID: 1789665121817733366 |
Series |
LoongArch: Add pv ipi support on LoongArch VM
|
|
Message
maobibo
Feb. 1, 2024, 3:19 a.m. UTC
This patchset adds pv ipi support for VM. On physical machine, ipi HW uses IOCSR registers, however there is trap into hypervisor when vcpu accesses IOCSR registers if system is in VM mode. SWI is a interrupt mechanism like SGI on ARM, software can send interrupt to CPU, only that on LoongArch SWI can only be sent to local CPU now. So SWI can not used for IPI on real HW system, however it can be used on VM when combined with hypercall method. This patch uses SWI interrupt for IPI mechanism, SWI injection uses hypercall method. And there is one trap with IPI sending, however with IPI receiving there is no trap. with IOCSR HW ipi method, there will be two trap into hypervisor with ipi receiving. Also this patch adds IPI multicast support for VM, this idea comes from x86 pv ipi. IPI can be sent to 128 vcpus in one time. Here is the microbenchmarck data with perf bench futex wake case on 3C5000 single-way machine, there are 16 cpus on 3C5000 single-way machine, VM has 16 vcpus also. The benchmark data is ms time unit to wakeup 16 threads, the performance is higher if data is smaller. perf bench futex wake, Wokeup 16 of 16 threads in ms --physical machine-- --VM original-- --VM with pv ipi patch-- 0.0176 ms 0.1140 ms 0.0481 ms --- Change in V4: 1. Modfiy pv ipi hook function name call_func_ipi() and call_func_single_ipi() with send_ipi_mask()/send_ipi_single(), since pv ipi is used for both remote function call and reschedule notification. 2. Refresh changelog. Change in V3: 1. Add 128 vcpu ipi multicast support like x86 2. Change cpucfg base address from 0x10000000 to 0x40000000, in order to avoid confliction with future hw usage 3. Adjust patch order in this patchset, move patch Refine-ipi-ops-on-LoongArch-platform to the first one. Change in V2: 1. Add hw cpuid map support since ipi routing uses hw cpuid 2. Refine changelog description 3. Add hypercall statistic support for vcpu 4. Set percpu pv ipi message buffer aligned with cacheline 5. Refine pv ipi send logic, do not send ipi message with if there is pending ipi message. --- Bibo Mao (6): LoongArch/smp: Refine ipi ops on LoongArch platform LoongArch: KVM: Add hypercall instruction emulation support LoongArch: KVM: Add cpucfg area for kvm hypervisor LoongArch: Add paravirt interface for guest kernel LoongArch: KVM: Add vcpu search support from physical cpuid LoongArch: Add pv ipi support on LoongArch system arch/loongarch/Kconfig | 9 + arch/loongarch/include/asm/Kbuild | 1 - arch/loongarch/include/asm/hardirq.h | 5 + arch/loongarch/include/asm/inst.h | 1 + arch/loongarch/include/asm/irq.h | 10 +- arch/loongarch/include/asm/kvm_host.h | 27 +++ arch/loongarch/include/asm/kvm_para.h | 157 ++++++++++++++++++ arch/loongarch/include/asm/kvm_vcpu.h | 1 + arch/loongarch/include/asm/loongarch.h | 11 ++ arch/loongarch/include/asm/paravirt.h | 27 +++ .../include/asm/paravirt_api_clock.h | 1 + arch/loongarch/include/asm/smp.h | 31 ++-- arch/loongarch/include/uapi/asm/Kbuild | 2 - arch/loongarch/kernel/Makefile | 1 + arch/loongarch/kernel/irq.c | 24 +-- arch/loongarch/kernel/paravirt.c | 154 +++++++++++++++++ arch/loongarch/kernel/perf_event.c | 14 +- arch/loongarch/kernel/setup.c | 2 + arch/loongarch/kernel/smp.c | 60 ++++--- arch/loongarch/kernel/time.c | 12 +- arch/loongarch/kvm/exit.c | 125 ++++++++++++-- arch/loongarch/kvm/vcpu.c | 94 ++++++++++- arch/loongarch/kvm/vm.c | 11 ++ 23 files changed, 678 insertions(+), 102 deletions(-) create mode 100644 arch/loongarch/include/asm/kvm_para.h create mode 100644 arch/loongarch/include/asm/paravirt.h create mode 100644 arch/loongarch/include/asm/paravirt_api_clock.h delete mode 100644 arch/loongarch/include/uapi/asm/Kbuild create mode 100644 arch/loongarch/kernel/paravirt.c base-commit: 1bbb19b6eb1b8685ab1c268a401ea64380b8bbcb
Comments
Hi, On 2/1/24 11:19, Bibo Mao wrote: > [snip] > > Here is the microbenchmarck data with perf bench futex wake case on 3C5000 > single-way machine, there are 16 cpus on 3C5000 single-way machine, VM > has 16 vcpus also. The benchmark data is ms time unit to wakeup 16 threads, > the performance is higher if data is smaller. > > perf bench futex wake, Wokeup 16 of 16 threads in ms > --physical machine-- --VM original-- --VM with pv ipi patch-- > 0.0176 ms 0.1140 ms 0.0481 ms > > --- > Change in V4: > 1. Modfiy pv ipi hook function name call_func_ipi() and > call_func_single_ipi() with send_ipi_mask()/send_ipi_single(), since pv > ipi is used for both remote function call and reschedule notification. > 2. Refresh changelog. > > Change in V3: > 1. Add 128 vcpu ipi multicast support like x86 > 2. Change cpucfg base address from 0x10000000 to 0x40000000, in order > to avoid confliction with future hw usage > 3. Adjust patch order in this patchset, move patch > Refine-ipi-ops-on-LoongArch-platform to the first one. Sorry for the late reply (and Happy Chinese New Year), and thanks for providing microbenchmark numbers! But it seems the more comprehensive CoreMark results were omitted (that's also absent in v3)? While the changes between v4 and v2 shouldn't be performance-sensitive IMO (I haven't checked carefully though), it could be better to showcase the improvements / non-harmfulness of the changes and make us confident in accepting the changes.
On 2/15/24 18:11, WANG Xuerui wrote: > Sorry for the late reply (and Happy Chinese New Year), and thanks for > providing microbenchmark numbers! But it seems the more comprehensive > CoreMark results were omitted (that's also absent in v3)? While the Of course the benchmark suite should be UnixBench instead of CoreMark. Lesson: don't multi-task code reviews, especially not after consuming beer -- a cup of coffee won't fully cancel the influence. ;-)
On 2024/2/15 下午6:25, WANG Xuerui wrote: > On 2/15/24 18:11, WANG Xuerui wrote: >> Sorry for the late reply (and Happy Chinese New Year), and thanks for >> providing microbenchmark numbers! But it seems the more comprehensive >> CoreMark results were omitted (that's also absent in v3)? While the > > Of course the benchmark suite should be UnixBench instead of CoreMark. > Lesson: don't multi-task code reviews, especially not after consuming > beer -- a cup of coffee won't fully cancel the influence. ;-) > Where is rule about benchmark choices like UnixBench/Coremark for ipi improvement? Regards Bibo Mao
On 2/17/24 11:15, maobibo wrote: > On 2024/2/15 下午6:25, WANG Xuerui wrote: >> On 2/15/24 18:11, WANG Xuerui wrote: >>> Sorry for the late reply (and Happy Chinese New Year), and thanks for >>> providing microbenchmark numbers! But it seems the more comprehensive >>> CoreMark results were omitted (that's also absent in v3)? While the >> >> Of course the benchmark suite should be UnixBench instead of CoreMark. >> Lesson: don't multi-task code reviews, especially not after consuming >> beer -- a cup of coffee won't fully cancel the influence. ;-) >> > Where is rule about benchmark choices like UnixBench/Coremark for ipi > improvement? Sorry for the late reply. The rules are mostly unwritten, but in general you can think of the preference of benchmark suites as a matter of "effectiveness" -- the closer it's to some real workload in the wild, the better. Micro-benchmarks is okay for illustrating the points, but without demonstrating the impact on realistic workloads, a change could be "useless" in practice or even decrease various performance metrics (be that throughput or latency or anything that matters in the certain case), but get accepted without notice.
On 2024/2/22 下午5:34, WANG Xuerui wrote: > On 2/17/24 11:15, maobibo wrote: >> On 2024/2/15 下午6:25, WANG Xuerui wrote: >>> On 2/15/24 18:11, WANG Xuerui wrote: >>>> Sorry for the late reply (and Happy Chinese New Year), and thanks >>>> for providing microbenchmark numbers! But it seems the more >>>> comprehensive CoreMark results were omitted (that's also absent in >>>> v3)? While the >>> >>> Of course the benchmark suite should be UnixBench instead of >>> CoreMark. Lesson: don't multi-task code reviews, especially not after >>> consuming beer -- a cup of coffee won't fully cancel the influence. ;-) >>> >> Where is rule about benchmark choices like UnixBench/Coremark for ipi >> improvement? > > Sorry for the late reply. The rules are mostly unwritten, but in general > you can think of the preference of benchmark suites as a matter of > "effectiveness" -- the closer it's to some real workload in the wild, > the better. Micro-benchmarks is okay for illustrating the points, but > without demonstrating the impact on realistic workloads, a change could > be "useless" in practice or even decrease various performance metrics > (be that throughput or latency or anything that matters in the certain > case), but get accepted without notice. yes, micro-benchmark cannot represent the real world, however it does not mean that UnixBench/Coremark should be run. You need to point out what is the negative effective from code, or what is the possible real scenario which may benefit. And points out the reasonable benchmark sensitive for IPIs rather than blindly saying UnixBench/Coremark. Regards Bibo Mao >
On 2/22/24 18:06, maobibo wrote: > > > On 2024/2/22 下午5:34, WANG Xuerui wrote: >> On 2/17/24 11:15, maobibo wrote: >>> On 2024/2/15 下午6:25, WANG Xuerui wrote: >>>> On 2/15/24 18:11, WANG Xuerui wrote: >>>>> Sorry for the late reply (and Happy Chinese New Year), and thanks >>>>> for providing microbenchmark numbers! But it seems the more >>>>> comprehensive CoreMark results were omitted (that's also absent in >>>>> v3)? While the >>>> >>>> Of course the benchmark suite should be UnixBench instead of >>>> CoreMark. Lesson: don't multi-task code reviews, especially not >>>> after consuming beer -- a cup of coffee won't fully cancel the >>>> influence. ;-) >>>> >>> Where is rule about benchmark choices like UnixBench/Coremark for ipi >>> improvement? >> >> Sorry for the late reply. The rules are mostly unwritten, but in >> general you can think of the preference of benchmark suites as a >> matter of "effectiveness" -- the closer it's to some real workload in >> the wild, the better. Micro-benchmarks is okay for illustrating the >> points, but without demonstrating the impact on realistic workloads, a >> change could be "useless" in practice or even decrease various >> performance metrics (be that throughput or latency or anything that >> matters in the certain case), but get accepted without notice. > yes, micro-benchmark cannot represent the real world, however it does > not mean that UnixBench/Coremark should be run. You need to point out > what is the negative effective from code, or what is the possible real > scenario which may benefit. And points out the reasonable benchmark > sensitive for IPIs rather than blindly saying UnixBench/Coremark. I was not meaning to argue with you, nor was I implying that your changes "must be regressing things even though I didn't check myself" -- my point is, *any* comparison with realistic workload that shows the performance mostly unaffected inside/outside KVM, would give reviewers (and yourself too) much more confidence in accepting the change. For me, personally I think a microbenchmark could be enough, because the only externally-visible change is the IPI mechanism overhead, but please consider other reviewers that may or may not be familiar enough with LoongArch to be able to notice the "triviality". Also, given the 6-patch size of the series, it could hardly be considered "trivial".