Message ID | 20221021174116.7200-1-logang@deltatee.com |
---|---|
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4242:0:0:0:0:0 with SMTP id s2csp825512wrr; Fri, 21 Oct 2022 10:44:55 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7WRKGVIFmq09RlJNCXRdAwM77dm8zljAEFf7pS6HUTgYw5d75oPv/F9vy7RLSwQmdDKYtF X-Received: by 2002:a05:6a00:88f:b0:558:8186:3ec3 with SMTP id q15-20020a056a00088f00b0055881863ec3mr19750456pfj.83.1666374284204; Fri, 21 Oct 2022 10:44:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666374284; cv=none; d=google.com; s=arc-20160816; b=pKeukvkGREobfu2+uPADoHF/aBCFbaNaZBP8JOvT3sSn8MLF2Z+To+QEhsZrHTOHMz 4R38Y8zyKRkqsLPTyfaoA7txvxignysuSSmtYQUaKAkHvhOXIAStge3O6kGBuJQjjjiI ScDPbAjy4ze6gJMrUu2Xe1jP0S8mbbVhC0X4bHdCkHlXAprIu3X3e1hjDfL5JK5N+IZr krdYA0e87viE59FGe1wWIbVw7bERyyPObQe5A6kOR0C4sMGlqAZeQ1Usw3DSZ9scIXll Q/MomrdOHQYpk7CCIzewC4BhOBvfNNr12ReekEwy92+xug2ikUd7V9yX1Ggw31E+ndkX ORZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:subject:content-transfer-encoding:mime-version :message-id:date:cc:to:from:dkim-signature; bh=nF0SVUXmUm6/1jQhAiGtVkXjqXLugkxBCw1H6lsq9qw=; b=bvFsL/XwxTo/2/RQA/IxK6P1zgwBsPmlftFDqCSpxhGFqja/mj4sKfUiZ6iyNCury8 qG5kPMFbkcjwHyBydFX/dYAw5NkcNjirFckS2ye+47/xCFRowVFLverL0m4eO4AVJdmA 2TIi1un+oFeDZSSFQ7qq8qj96xl79EUPieSKARmUyib52ynI53ke2n5i00Bv10sFLwoE ASdGaAnUD7a3tJahwnRfzgSM5iBjLvOxeVS7ztYZxWwU4AiAItvSfh3Rf+wmUpcXSUUA uG4b0XBoio28CQkgCLLngNstHzvOLFi7RCoeldkSgz/5d1/PlVdYfFU9lL4u+uia24lA o34A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@deltatee.com header.s=20200525 header.b=kAAkNydF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=deltatee.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id r12-20020a17090a1bcc00b00203bf024e7asi8852674pjr.7.2022.10.21.10.44.31; Fri, 21 Oct 2022 10:44:44 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@deltatee.com header.s=20200525 header.b=kAAkNydF; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=deltatee.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229998AbiJURlz (ORCPT <rfc822;mntrajkot1@gmail.com> + 99 others); Fri, 21 Oct 2022 13:41:55 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59860 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229918AbiJURla (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Fri, 21 Oct 2022 13:41:30 -0400 Received: from ale.deltatee.com (ale.deltatee.com [204.191.154.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F414324AE3C; Fri, 21 Oct 2022 10:41:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=deltatee.com; s=20200525; h=Subject:MIME-Version:Message-Id:Date:Cc:To:From :references:content-disposition:in-reply-to; bh=nF0SVUXmUm6/1jQhAiGtVkXjqXLugkxBCw1H6lsq9qw=; b=kAAkNydFPlQ84zWbqcIf6HYHWw swriGADMx3GagNeM2QyZOZagVVUqkxGVMUP+thgek92nyZqj/nN+tAEQK++PGuuoctMWY7q1X57ex YZ2My7wYnjj8Zk0+4XR1miPNQHBm/khken5A0/4BCVp3HT02+NvGZbh5Q3SXn1r3GqjvJFNhXrphm 28Gro+M5SHbcfCJ4skRzMSah8NQ1cVbBRqLsjR/6Jp3gMKUCuIfTtFnZyWnLBrek+WNqKAZsghIX7 NX/kmablgeZJ2HhcxDrHbzJHJsfxJl5MKmQDtTB2kmrQWMJfF4EmYhAKFHZnzohfJy1BnP16QVC/c WXH1a7vA==; Received: from cgy1-donard.priv.deltatee.com ([172.16.1.31]) by ale.deltatee.com with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from <gunthorp@deltatee.com>) id 1olw1I-00DoHx-T9; Fri, 21 Oct 2022 11:41:22 -0600 Received: from gunthorp by cgy1-donard.priv.deltatee.com with local (Exim 4.94.2) (envelope-from <gunthorp@deltatee.com>) id 1olw1F-0001t4-HC; Fri, 21 Oct 2022 11:41:17 -0600 From: Logan Gunthorpe <logang@deltatee.com> To: linux-kernel@vger.kernel.org, linux-nvme@lists.infradead.org, linux-block@vger.kernel.org, linux-pci@vger.kernel.org, linux-mm@kvack.org Cc: Christoph Hellwig <hch@lst.de>, Greg Kroah-Hartman <gregkh@linuxfoundation.org>, Dan Williams <dan.j.williams@intel.com>, Jason Gunthorpe <jgg@ziepe.ca>, =?utf-8?q?Christian_K=C3=B6nig?= <christian.koenig@amd.com>, John Hubbard <jhubbard@nvidia.com>, Don Dutile <ddutile@redhat.com>, Matthew Wilcox <willy@infradead.org>, Daniel Vetter <daniel.vetter@ffwll.ch>, Minturn Dave B <dave.b.minturn@intel.com>, Jason Ekstrand <jason@jlekstrand.net>, Dave Hansen <dave.hansen@linux.intel.com>, Xiong Jianxin <jianxin.xiong@intel.com>, Bjorn Helgaas <helgaas@kernel.org>, Ira Weiny <ira.weiny@intel.com>, Robin Murphy <robin.murphy@arm.com>, Martin Oliveira <martin.oliveira@eideticom.com>, Chaitanya Kulkarni <ckulkarnilinux@gmail.com>, Ralph Campbell <rcampbell@nvidia.com>, Stephen Bates <sbates@raithlin.com>, Logan Gunthorpe <logang@deltatee.com> Date: Fri, 21 Oct 2022 11:41:07 -0600 Message-Id: <20221021174116.7200-1-logang@deltatee.com> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-SA-Exim-Connect-IP: 172.16.1.31 X-SA-Exim-Rcpt-To: linux-nvme@lists.infradead.org, linux-kernel@vger.kernel.org, linux-block@vger.kernel.org, linux-pci@vger.kernel.org, linux-mm@kvack.org, hch@lst.de, gregkh@linuxfoundation.org, jgg@ziepe.ca, christian.koenig@amd.com, ddutile@redhat.com, willy@infradead.org, daniel.vetter@ffwll.ch, jason@jlekstrand.net, dave.hansen@linux.intel.com, helgaas@kernel.org, dan.j.williams@intel.com, dave.b.minturn@intel.com, jianxin.xiong@intel.com, ira.weiny@intel.com, robin.murphy@arm.com, martin.oliveira@eideticom.com, ckulkarnilinux@gmail.com, jhubbard@nvidia.com, rcampbell@nvidia.com, sbates@raithlin.com, logang@deltatee.com X-SA-Exim-Mail-From: gunthorp@deltatee.com X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net X-Spam-Level: X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_PASS,SPF_PASS, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 Subject: [PATCH v11 0/9] Userspace P2PDMA with O_DIRECT NVMe devices X-SA-Exim-Version: 4.2.1 (built Sat, 13 Feb 2021 17:57:42 +0000) X-SA-Exim-Scanned: Yes (on ale.deltatee.com) Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1747320081487230327?= X-GMAIL-MSGID: =?utf-8?q?1747320081487230327?= |
Series |
Userspace P2PDMA with O_DIRECT NVMe devices
|
|
Message
Logan Gunthorpe
Oct. 21, 2022, 5:41 p.m. UTC
Hi, This is the latest P2PDMA userspace patch set. This version includes some cleanup from feedback from the last posting[1]. This patch set enables userspace P2PDMA by allowing userspace to mmap() allocated chunks of the CMB. The resulting VMA can be passed only to O_DIRECT IO on NVMe backed files or block devices. A flag is added to GUP() in Patch 1, then Patches 2 through 6 wire this flag up based on whether the block queue indicates P2PDMA support. Patches 7 creates the sysfs resource that can hand out the VMAs and Patch 8 adds brief documentation for the new interface. Feedback welcome. This series is based on v6.1-rc1. A git branch is available here: https://github.com/sbates130272/linux-p2pmem/ p2pdma_user_cmb_v11 Thanks, Logan [1] https://lkml.kernel.org/r/20220922163926.7077-1-logang@deltatee.com -- Changes in v11: - Rebased onto v6.1-rc1, fixed minor conflict in bio_map_user_iov - The GUP test was moved to try_grab_page() and try_grab_folio(). This ought to be a bit more future proof. It required adding a new cleanup patch to return a proper error code from try_grab_page(). (Per Jason) Changes in v10: - Rebased onto v6.0-rc6 - Reworked iov iter changes to reuse the code better and name them without the _flags() prefix (per Christoph) - Renamed a number of flags variables to gup_flags (per John) - Minor fixups to the last documentation patch (from Greg and John) Changes in v9: - Rebased onto v6.0-rc2, included reworking the iov_iter patch due to changes there - Drop the char device mmap implementation in favour of a sysfs based interface. (per Christoph) (v8 only included the first half of the series and was merged for v6.0) Changes in v8: - Rebase onto v5.19-rc1 - Rework how the pages are stored in the VMA per Jason's suggestion Changes in v7: - Rebased onto v5.18-rc1 which includes Christophs cleanup to free_zone_device_page() (similar to Ralph's patch). - Fix bug with concurrent first calls to pci_p2pdma_vma_fault() that caused a double allocation and lost p2p memory. Noticed by Andrew Maier. - Collected a Reviewed-by tag from Chaitanya. - Numerous minor fixes to commit messages -- Logan Gunthorpe (9): mm: allow multiple error returns in try_grab_page() mm: introduce FOLL_PCI_P2PDMA to gate getting PCI P2PDMA pages iov_iter: introduce iov_iter_get_pages_[alloc_]flags() block: add check when merging zone device pages lib/scatterlist: add check when merging zone device pages block: set FOLL_PCI_P2PDMA in __bio_iov_iter_get_pages() block: set FOLL_PCI_P2PDMA in bio_map_user_iov() PCI/P2PDMA: Allow userspace VMA allocations through sysfs ABI: sysfs-bus-pci: add documentation for p2pmem allocate Documentation/ABI/testing/sysfs-bus-pci | 10 ++ block/bio.c | 11 ++- block/blk-map.c | 12 ++- drivers/pci/p2pdma.c | 124 ++++++++++++++++++++++++ include/linux/mm.h | 3 +- include/linux/mmzone.h | 24 +++++ include/linux/uio.h | 6 ++ lib/iov_iter.c | 32 ++++-- lib/scatterlist.c | 25 +++-- mm/gup.c | 45 ++++++--- mm/huge_memory.c | 19 ++-- mm/hugetlb.c | 23 +++-- 12 files changed, 280 insertions(+), 54 deletions(-) base-commit: 9abf2313adc1ca1b6180c508c25f22f9395cc780 -- 2.30.2
Comments
The series looks good to me know. How do we want to handle it? I think we need a special branch somewhere (maybe in the block or mm trees?) so that we can base the other iov_iter work from John on it. Also Al has a whole bunch of iov_iter changes that we probably want on the same branch as well, although some of those (READ vs WRITE fixups) look like 6.1 material to me.
On 10/24/22 08:03, Christoph Hellwig wrote: > The series looks good to me know. How do we want to handle it? I think > we need a special branch somewhere (maybe in the block or mm trees?) > so that we can base the other iov_iter work from John on it. Also > Al has a whole bunch of iov_iter changes that we probably want on > the same branch as well, although some of those (READ vs WRITE fixups) > look like 6.1 material to me. > A little earlier, Jens graciously offered [1] to provide a topic branch, such as: for-6.2/block-gup [2] (I've moved the name forward from 6.1 to 6.2, because that discussion was 7 weeks ago.) [1] https://lore.kernel.org/ae675a01-90e6-4af1-6c43-660b3a6c7b72@kernel.dk [2] https://lore.kernel.org/55a2d67f-9a12-9fe6-d73b-8c3f5eb36f31@kernel.dk thanks,
On Mon, Oct 24, 2022 at 12:15:56PM -0700, John Hubbard wrote: > A little earlier, Jens graciously offered [1] to provide a topic branch, > such as: > > for-6.2/block-gup [2] > > (I've moved the name forward from 6.1 to 6.2, because that discussion > was 7 weeks ago.) So what are we going to do with this series? It would be sad to miss the merge window again.
@add Jens On 2022-11-07 23:56, Christoph Hellwig wrote: > On Mon, Oct 24, 2022 at 12:15:56PM -0700, John Hubbard wrote: >> A little earlier, Jens graciously offered [1] to provide a topic branch, >> such as: >> >> for-6.2/block-gup [2] >> >> (I've moved the name forward from 6.1 to 6.2, because that discussion >> was 7 weeks ago.) > > So what are we going to do with this series? It would be sad to miss > the merge window again. I noticed Jens wasn't copied on this series. I've added him. It would be nice to get this in someone's tree soon. Thanks! Logan
On 11/9/22 10:28 AM, Logan Gunthorpe wrote: > @add Jens > > On 2022-11-07 23:56, Christoph Hellwig wrote: >> On Mon, Oct 24, 2022 at 12:15:56PM -0700, John Hubbard wrote: >>> A little earlier, Jens graciously offered [1] to provide a topic branch, >>> such as: >>> >>> for-6.2/block-gup [2] >>> >>> (I've moved the name forward from 6.1 to 6.2, because that discussion >>> was 7 weeks ago.) >> >> So what are we going to do with this series? It would be sad to miss >> the merge window again. > > I noticed Jens wasn't copied on this series. I've added him. It would be > nice to get this in someone's tree soon. I took a look and the series looks fine to me.