From patchwork Tue Sep 12 12:59:30 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Zhang, Tina" X-Patchwork-Id: 13874 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:9ecd:0:b0:3f2:4152:657d with SMTP id t13csp615791vqx; Tue, 12 Sep 2023 11:57:44 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEqqytcLaYQYftYaE3mWH9K303gBthLi8qMK4TwHbFrYu9FmShZ7bT4RzMYfQyIxNGOS9kK X-Received: by 2002:a05:6870:2cf:b0:1c0:1caf:3324 with SMTP id r15-20020a05687002cf00b001c01caf3324mr524274oaf.44.1694545063956; Tue, 12 Sep 2023 11:57:43 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694545063; cv=none; d=google.com; s=arc-20160816; b=t2WP9f++S7AsCIqZBmvFoOCZWHPTQ32bM1edH9HBzvD6ZeckWZrK24k3oevVvM9SHX OZeCDkORUPC5Pe97Bi6VyUH43E8XPl+9JXoVjI2hZVuCft+eWhRsX6Wjvfpn2Gd3i1um 5liJhzeCHkUfhiuwn9h9YFy0dtAVnwGHQ3ZWb1/grA4NL2fWq1N0/n69LLOCNYyN9Q60 hTQPPyfSo3XjVLy9kPj+Dfpv39cprPjvbzK6bLBAfLp/SBxjFnJLP6R5xSghKqaJGZKj BBCDtHJisiVDegAriITwQNyWxdcFiOujSK0pO0w20JKQRshiG3m5IRFOT0k5r1UuxV9A 0tCw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=T8Cc4eTSZ9R5Y9q2ITZQkrhow1k6RVSPaMKFjvRteoQ=; fh=zW1GPeUL3MQGLvvmLmBmaazP2MzqJ1fOEi8He5Dzy3k=; b=hcs+z8IWVWjKdKEEDvPyl5uQDO8W0+3hVZfhmxvvxU1y7DWPbjoKK9sy/8ivav2drT U5EfBDpBhCoI+znFM13Qq5YAYKouheFYfIXG7MNTuEA4YILLZNnXz9lY7DnES+NC7Q0B QTALSXVUNH+aAIZge087Zug5cEs244VnULdGB0k5MoQiGFltwiG5tr49l/L4g6tZ1IqG Jg+eF3Ca8n0VhnrTLP2bSZmEgKVOKdNxJkBaBDhytpTz2bPoL9Lgw7Zv2+D3ajWp2mzd OK3jQV7y2ocYI6PDc/nIT600wX3o4bmL4MuJbBclnkoHR9qWfHig42xMzZldY8764ejO 4Sig== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=HH2UkIq1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from pete.vger.email (pete.vger.email. [23.128.96.36]) by mx.google.com with ESMTPS id q200-20020a632ad1000000b005775d4973e5si5203477pgq.728.2023.09.12.11.57.39 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Sep 2023 11:57:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) client-ip=23.128.96.36; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=HH2UkIq1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by pete.vger.email (Postfix) with ESMTP id 5E08081D55E0; Tue, 12 Sep 2023 06:00:09 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at pete.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235239AbjILM7y (ORCPT + 37 others); Tue, 12 Sep 2023 08:59:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34100 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229605AbjILM7w (ORCPT ); Tue, 12 Sep 2023 08:59:52 -0400 Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.115]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5AE12E50 for ; Tue, 12 Sep 2023 05:59:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1694523588; x=1726059588; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=imtiLFkDp+qt4Mfxgc764xTzhAS0qgKYmQIrlfbKTgI=; b=HH2UkIq1lqZRC1ocGSSRXJpzaGuzSkBeAS00CxRCNqUYs+vilcKxLVuD TJWL7uTqkaotSqDWz0ge5or4ZWwGjkIUq+o0dd5Al6JeFTsvWVZ5ZDfZF olPQmxtU27tK1H4qLgJFwj+U2X2tKyIFworJu2eY2HES8519M9fTGR3uC Z2kBbX+X6Zw8csDdVPsFvgNn8yayQP5/Y6Exxgf3/3CsaJWbBVBnTuK0u MXrYVDRf83hHZe2g3lwcEZEY9T58kz9m3Z9UxdXrzTE/8m2x2Cbgii3tY FnDYG4FaI7dicZy/OKrIsdfRl+1B0hfNkmmgCgYngyKSunodUqQiRz/9s Q==; X-IronPort-AV: E=McAfee;i="6600,9927,10831"; a="378277292" X-IronPort-AV: E=Sophos;i="6.02,139,1688454000"; d="scan'208";a="378277292" Received: from fmsmga002.fm.intel.com ([10.253.24.26]) by fmsmga103.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 12 Sep 2023 05:59:47 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10831"; a="858799208" X-IronPort-AV: E=Sophos;i="6.02,139,1688454000"; d="scan'208";a="858799208" Received: from tongshi-mobl.ccr.corp.intel.com (HELO tinazhan-desk1.intel.com) ([10.255.28.214]) by fmsmga002-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 12 Sep 2023 05:59:45 -0700 From: Tina Zhang To: Jason Gunthorpe , Kevin Tian , Lu Baolu Cc: Michael Shavit , Vasant Hegde , iommu@lists.linux.dev, linux-kernel@vger.kernel.org, Tina Zhang Subject: [PATCH v4 0/6] Share sva domains with all devices bound to a mm Date: Tue, 12 Sep 2023 20:59:30 +0800 Message-Id: <20230912125936.722348-1-tina.zhang@intel.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (pete.vger.email [0.0.0.0]); Tue, 12 Sep 2023 06:00:09 -0700 (PDT) X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on pete.vger.email X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1776859284796365668 X-GMAIL-MSGID: 1776859284796365668 This series is to share sva(shared virtual addressing) domains with all devices bound to one mm. Problem ------- In the current iommu core code, sva domain is allocated per IOMMU group, when device driver is binding a process address space to a device (which is handled in iommu_sva_bind_device()). If one than more device is bound to the same process address space, there must be more than one sva domain instance, with each device having one. In other words, the sva domain doesn't share between those devices bound to the same process address space, and that leads to two problems: 1) device driver has to duplicate sva domains with enqcmd, as those sva domains have the same PASID and are relevant to one virtual address space. This makes the sva domain handling complex in device drivers. 2) IOMMU driver cannot get sufficient info of the IOMMUs that have devices behind them bound to the same virtual address space, when handling mmu_notifier_ops callbacks. As a result, IOMMU IOTLB invalidation is performed per device instead of per IOMMU, and that may lead to superfluous IOTLB invalidation issue, especially in a virtualization environment where all devices may be behind one virtual IOMMU. Solution -------- This patch-set tries to fix those two problems by allowing sharing sva domains with all devices bound to a mm. To achieve this, a new structure pointer is introduced to mm to replace the old PASID field, which can keep the info of PASID as well as the corresponding shared sva domains. Besides, function iommu_sva_bind_device() is updated to ensure a new sva domain can only be allocated when the old ones cannot work for the IOMMU. With these changes, a device driver can expect one sva domain could work for per PASID instance(e.g., enqcmd PASID instance), and therefore may get rid of handling sva domain duplication. Besides, IOMMU driver (e.g., intel vt-d driver) can get sufficient info (e.g., the info of the IOMMUs having their devices bound to one virtual address space) when handling mmu_notifier_ops callbacks, to remove the redundant IOTLB invalidations. Arguably there shouldn't be more than one sva_domain with the same PASID, and in any sane configuration there should be only 1 type of IOMMU driver that needs only 1 SVA domain. However, in reality, IOMMUs on one platform may not be identical to each other. Thus, attaching a sva domain that has been successfully bound to device A behind a IOMMU A, to device B behind IOMMU B may get failed due to the difference between IOMMU A and IOMMU B. In this case, a new sva domain with the same PASID needs to be allocated to work with IOMMU B. That's why we need a list to keep sva domains of one PASID. For the platform where IOMMUs are compatible to each other, there should be one sva domain in the list. v4: - Rebase to v6.6-rc1. v3: - Add a comment describing domain->next. - Expand explanation of why PASID isn't released in iommu_sva_unbind_device(). - Add a patch to remove mm->pasid in intel_sva_bind_mm() v2: - Add mm_get_enqcmd_pasid(). - Update commit message. v1: https://lore.kernel.org/linux-iommu/20230808074944.7825-1-tina.zhang@intel.com/ Tina Zhang (6): iommu: Add mm_get_enqcmd_pasid() helper function iommu/vt-d: Remove mm->pasid in intel_sva_bind_mm() iommu: Introduce mm_get_pasid() helper function mm: Add structure to keep sva information iommu: Support mm PASID 1:n with sva domains mm: Deprecate pasid field arch/x86/kernel/traps.c | 2 +- .../iommu/arm/arm-smmu-v3/arm-smmu-v3-sva.c | 12 ++--- drivers/iommu/intel/svm.c | 14 +++--- drivers/iommu/iommu-sva.c | 45 ++++++++++++------- include/linux/iommu.h | 31 ++++++++++--- include/linux/mm_types.h | 3 +- kernel/fork.c | 1 - mm/init-mm.c | 3 -- 8 files changed, 70 insertions(+), 41 deletions(-)