From patchwork Wed Nov 2 20:34:04 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Oded Gabbay X-Patchwork-Id: 14479 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:6687:0:0:0:0:0 with SMTP id l7csp130537wru; Wed, 2 Nov 2022 13:41:09 -0700 (PDT) X-Google-Smtp-Source: AMsMyM5BOpVjX6SQdtfn4w02Cme/gjJgw/SUdY/xB6urkMGf//DSc5W0JPY65fC+uas4TUAeF6r/ X-Received: by 2002:aa7:c14b:0:b0:461:c47d:48cf with SMTP id r11-20020aa7c14b000000b00461c47d48cfmr25858902edp.83.1667421669227; Wed, 02 Nov 2022 13:41:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1667421669; cv=none; d=google.com; s=arc-20160816; b=xsdKWGkuHE+4HoYSsk3BjU2XpL3M2XAPOLpoIFCQWTLgzSdFUB5qanUWa1klfPqcJo HT1fpRl5osNKmH9vsIjqXeuow2soxEQrDCQIdRpP0O/Os87DI/qF7qNsjujwBvmhWAOd gob4kKD6IvEwCu/R0SJrxjoxJTEYloY8fJOyd+TfNPQWOjLegjYtbPspVhB0J+erIFpN cJc0/Lnn56DFF2YYKihPwGiVc0bPH4MwLDa8Y64dZ42qtykccC80aKBfKHDDMt9yGPqa JUV73ACH45nmHLcqE5JUNM7eI40hUjMutfi7XCZ39P2az3QA690/ilnKBAciPbTS6Hzx NVJA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=3cyTu7LP/c6J33STMZLzvScjUVkJ6ZP/D3KDjrlsE1I=; b=0ZWQaf67EcYQArl2bqMvNZCtHFgHqz+EOViDERJLpwNfO940JEVTct5wk7KEHDbeiJ hMy3J+H+8DWB2YcDCPZPw3L/2LJEitLoanXX7KT9cBjBy8IDUGZlJA+Tb2wXW8JBO5fa ynIxSjdK8EQAa0JKkmKDxyUL/i04oYGUFD6dXeJ4OZknuH1ZtkIbBTnd9aaDgCYeh9nZ qVkkyG5tBhuQYB+ooIfES9G8fwLOmcgUeW/OhBuu3HF5W19Wz+HNof+cWxyMCHulq9r3 tM7dhTMz0OrdQxeD0p1T63sZEOFmEQOYOtUxJ8ElbbRak8VGdJRUoOOBPnVlF7VjmMcD 5i1w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=F+s73xiv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id dz19-20020a0564021d5300b0045c13366de4si18413356edb.572.2022.11.02.13.40.35; Wed, 02 Nov 2022 13:41:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=F+s73xiv; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231246AbiKBUee (ORCPT + 99 others); Wed, 2 Nov 2022 16:34:34 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:59922 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231272AbiKBUe2 (ORCPT ); Wed, 2 Nov 2022 16:34:28 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D20C06403 for ; Wed, 2 Nov 2022 13:34:26 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 6052261BF7 for ; Wed, 2 Nov 2022 20:34:26 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 87A1FC433C1; Wed, 2 Nov 2022 20:34:20 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1667421265; bh=ATXrtyCVY8SWg2oejMlYWzbqx44OoY28tztz/qRQuUE=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=F+s73xivFwNUPpe2t3rohK3K116c2C27UiaLaAYwu/cHlVPkQc23snOR3VRtpOIOi +yb+0rQl3N9c0PT/kCJqZH/qXHTsfvI0rXkL4ByBkvA2I3YVeS4sZLyhQl6prmuP2Y EcxviBKK4lWD7aZ2z/ZPAOlEetIJFB+Ft8My172SgnkaxnvSXL5LLOT1Pvl0HjtRO5 mXFscuNkQMyDCn5IWvAbupf0+DxylxwfzBc2aCYjZeVLBL/mXVLFyQuE3CWv9tqopq 2Hz7mFhJ+1dcxFU7NYSu9IBQH6oOq2AeDL/qBvMBlmWx5Syw1NSoX3mw5GraZzKWKU b5IEWfzDAgDgg== From: Oded Gabbay To: David Airlie , Daniel Vetter , Arnd Bergmann , Greg Kroah-Hartman , linux-kernel@vger.kernel.org, dri-devel@lists.freedesktop.org, Jason Gunthorpe , John Hubbard , Alex Deucher Cc: Maarten Lankhorst , Maxime Ripard , Thomas Zimmermann , Yuji Ishikawa , Jiho Chu , Daniel Stone , Tvrtko Ursulin , Jeffrey Hugo , Christoph Hellwig , Kevin Hilman , Jagan Teki , Jacek Lawrynowicz , Maciej Kwapulinski Subject: [RFC PATCH v2 2/3] accel: add dedicated minor for accelerator devices Date: Wed, 2 Nov 2022 22:34:04 +0200 Message-Id: <20221102203405.1797491-3-ogabbay@kernel.org> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20221102203405.1797491-1-ogabbay@kernel.org> References: <20221102203405.1797491-1-ogabbay@kernel.org> MIME-Version: 1.0 X-Spam-Status: No, score=-8.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1748418343947052579?= X-GMAIL-MSGID: =?utf-8?q?1748418343947052579?= The accelerator devices are exposed to user-space using a dedicated major. In addition, they are represented in /dev with new, dedicated device char names: /dev/accel/accel*. This is done to make sure any user-space software that tries to open a graphic card won't open the accelerator device by mistake. The above implies that the minor numbering should be separated from the rest of the DRM devices. However, to avoid code duplication, we want the drm_minor structure to be able to represent the accelerator device. To achieve this, we add a new drm_minor* to drm_device that represents the accelerator device. This pointer is initialized for drivers that declare they handle compute accelerator, using a new driver feature flag called DRIVER_COMPUTE_ACCEL. It is important to note that this driver feature is mutually exclusive with DRIVER_RENDER. Devices that want to expose both graphics and compute device char files should be handled by two drivers that are connected using the auxiliary bus framework. In addition, we define a different xarray to handle the accelerators minors. This is done to make the minor's index be identical to the device index in /dev/. Any access to the xarray is done solely by functions in accel_drv.c, as the xarray is define as static. The DRM core functions call those functions in case they detect the minor's type is DRM_MINOR_ACCEL. We define a separate accel_open function (from drm_open) that the accel drivers should set as their open callback function. Both these functions eventually call the same drm_open_helper(), which had to be changed to be non-static so it can be called from accel_drv.c. accel_open() partially duplicates drm_open as I removed some code from it that handles legacy devices. Signed-off-by: Oded Gabbay --- Changes in v2: - Moved all accel minor handling code to accel_drv.c - Replaced deprecated idr with xarray drivers/accel/accel_drv.c | 205 +++++++++++++++++++++++++++++++++---- drivers/gpu/drm/drm_file.c | 2 +- include/drm/drm_accel.h | 29 +++++- include/drm/drm_device.h | 3 + include/drm/drm_drv.h | 8 ++ include/drm/drm_file.h | 21 +++- 6 files changed, 247 insertions(+), 21 deletions(-) -- 2.25.1 diff --git a/drivers/accel/accel_drv.c b/drivers/accel/accel_drv.c index 6132765ea054..964a93799936 100644 --- a/drivers/accel/accel_drv.c +++ b/drivers/accel/accel_drv.c @@ -9,13 +9,22 @@ #include #include #include +#include #include +#include +#include #include #include +static DEFINE_XARRAY_ALLOC(accel_minors_xa); + static struct dentry *accel_debugfs_root; -struct class *accel_class; +static struct class *accel_class; + +static struct device_type accel_sysfs_device_minor = { + .name = "accel_minor" +}; static char *accel_devnode(struct device *dev, umode_t *mode) { @@ -24,16 +33,6 @@ static char *accel_devnode(struct device *dev, umode_t *mode) static CLASS_ATTR_STRING(accel_version, 0444, "accel 1.0.0 20221018"); -/** - * accel_sysfs_init - initialize sysfs helpers - * - * This is used to create the ACCEL class, which is the implicit parent of any - * other top-level ACCEL sysfs objects. - * - * You must call accel_sysfs_destroy() to release the allocated resources. - * - * Return: 0 on success, negative error code on failure. - */ static int accel_sysfs_init(void) { int err; @@ -54,11 +53,6 @@ static int accel_sysfs_init(void) return 0; } -/** - * accel_sysfs_destroy - destroys ACCEL class - * - * Destroy the ACCEL device class. - */ static void accel_sysfs_destroy(void) { if (IS_ERR_OR_NULL(accel_class)) @@ -68,11 +62,185 @@ static void accel_sysfs_destroy(void) accel_class = NULL; } +/** + * accel_set_device_instance_params() - Set some device parameters for accel device + * @kdev: Pointer to the device instance. + * @index: The minor's index + * + * This function creates the dev_t of the device using the accel major and + * the device's minor number. In addition, it sets the class and type of the + * device instance to the accel sysfs class and device type, respectively. + */ +void accel_set_device_instance_params(struct device *kdev, int index) +{ + kdev->devt = MKDEV(ACCEL_MAJOR, index); + kdev->class = accel_class; + kdev->type = &accel_sysfs_device_minor; +} + +/** + * accel_minor_alloc() - Allocates a new accel minor + * + * This function access the accel minors xarray and allocates from it + * a new id to represent a new accel minor + * + * Return: A new id on success or error code in case xa_alloc failed + */ +int accel_minor_alloc(void) +{ + int rc, index; + + rc = xa_alloc(&accel_minors_xa, &index, NULL, + XA_LIMIT(0, ACCEL_MAX_MINORS - 1), GFP_KERNEL); + if (rc < 0) + return rc; + + return index; +} + +/** + * accel_minor_remove() - Remove an accel minor + * @index: The minor id to remove. + * + * This function access the accel minors xarray and removes from + * it the member with the id that is passed to this function. + */ +void accel_minor_remove(int index) +{ + xa_erase(&accel_minors_xa, index); +} + +/** + * accel_minor_replace() - Replace minor pointer in accel minors xarray. + * @minor: Pointer to the new minor. + * @index: The minor id to replace. + * + * This function access the accel minors xarray structure and replaces the pointer + * that is associated with an existing id. Because the minor pointer can be + * NULL, we need to explicitly pass the index. + * + * Return: 0 for success, negative value for error + */ +int accel_minor_replace(struct drm_minor *minor, int index) +{ + if (minor) { + void *entry; + + entry = xa_cmpxchg(&accel_minors_xa, index, NULL, minor, GFP_KERNEL); + if (xa_is_err(entry)) + return xa_err(entry); + } else { + xa_store(&accel_minors_xa, index, NULL, GFP_KERNEL); + } + + return 0; +} + +/* + * Looks up the given minor-ID and returns the respective DRM-minor object. The + * refence-count of the underlying device is increased so you must release this + * object with accel_minor_release(). + * + * The object can be only a drm_minor that represents an accel device. + * + * As long as you hold this minor, it is guaranteed that the object and the + * minor->dev pointer will stay valid! However, the device may get unplugged and + * unregistered while you hold the minor. + */ +static struct drm_minor *accel_minor_acquire(unsigned int minor_id) +{ + struct drm_minor *minor; + + xa_lock(&accel_minors_xa); + minor = xa_load(&accel_minors_xa, minor_id); + if (minor) + drm_dev_get(minor->dev); + xa_unlock(&accel_minors_xa); + + if (!minor) { + return ERR_PTR(-ENODEV); + } else if (drm_dev_is_unplugged(minor->dev)) { + drm_dev_put(minor->dev); + return ERR_PTR(-ENODEV); + } + + return minor; +} + +static void accel_minor_release(struct drm_minor *minor) +{ + drm_dev_put(minor->dev); +} + +/** + * accel_open - open method for ACCEL file + * @inode: device inode + * @filp: file pointer. + * + * This function must be used by drivers as their &file_operations.open method. + * It looks up the correct ACCEL device and instantiates all the per-file + * resources for it. It also calls the &drm_driver.open driver callback. + * + * Return: 0 on success or negative errno value on failure. + */ +int accel_open(struct inode *inode, struct file *filp) +{ + struct drm_device *dev; + struct drm_minor *minor; + int retcode; + + minor = accel_minor_acquire(iminor(inode)); + if (IS_ERR(minor)) + return PTR_ERR(minor); + + dev = minor->dev; + + atomic_fetch_inc(&dev->open_count); + + /* share address_space across all char-devs of a single device */ + filp->f_mapping = dev->anon_inode->i_mapping; + + retcode = drm_open_helper(filp, minor); + if (retcode) + goto err_undo; + + return 0; + +err_undo: + atomic_dec(&dev->open_count); + accel_minor_release(minor); + return retcode; +} +EXPORT_SYMBOL_GPL(accel_open); + static int accel_stub_open(struct inode *inode, struct file *filp) { - DRM_DEBUG("Operation not supported"); + const struct file_operations *new_fops; + struct drm_minor *minor; + int err; + + DRM_DEBUG("\n"); + + minor = accel_minor_acquire(iminor(inode)); + if (IS_ERR(minor)) + return PTR_ERR(minor); + + new_fops = fops_get(minor->dev->driver->fops); + if (!new_fops) { + err = -ENODEV; + goto out; + } + + replace_fops(filp, new_fops); + if (filp->f_op->open) + err = filp->f_op->open(inode, filp); + else + err = 0; + +out: + accel_minor_release(minor); - return -EOPNOTSUPP; + return err; } static const struct file_operations accel_stub_fops = { @@ -86,6 +254,7 @@ void accel_core_exit(void) unregister_chrdev(ACCEL_MAJOR, "accel"); debugfs_remove(accel_debugfs_root); accel_sysfs_destroy(); + WARN_ON(!xa_empty(&accel_minors_xa)); } int __init accel_core_init(void) diff --git a/drivers/gpu/drm/drm_file.c b/drivers/gpu/drm/drm_file.c index a8b4d918e9a3..64b4a3a87fbb 100644 --- a/drivers/gpu/drm/drm_file.c +++ b/drivers/gpu/drm/drm_file.c @@ -326,7 +326,7 @@ static int drm_cpu_valid(void) * Creates and initializes a drm_file structure for the file private data in \p * filp and add it into the double linked list in \p dev. */ -static int drm_open_helper(struct file *filp, struct drm_minor *minor) +int drm_open_helper(struct file *filp, struct drm_minor *minor) { struct drm_device *dev = minor->dev; struct drm_file *priv; diff --git a/include/drm/drm_accel.h b/include/drm/drm_accel.h index cf43a7b30f34..0c0ae387d075 100644 --- a/include/drm/drm_accel.h +++ b/include/drm/drm_accel.h @@ -8,12 +8,20 @@ #ifndef DRM_ACCEL_H_ #define DRM_ACCEL_H_ -#define ACCEL_MAJOR 261 +#include + +#define ACCEL_MAJOR 261 +#define ACCEL_MAX_MINORS 256 #if IS_ENABLED(CONFIG_ACCEL) void accel_core_exit(void); int accel_core_init(void); +void accel_minor_remove(int index); +int accel_minor_alloc(void); +int accel_minor_replace(struct drm_minor *minor, int index); +void accel_set_device_instance_params(struct device *kdev, int index); +int accel_open(struct inode *inode, struct file *filp); #else @@ -23,9 +31,28 @@ static inline void accel_core_exit(void) static inline int __init accel_core_init(void) { + /* Return 0 to allow drm_core_init to complete successfully */ return 0; } +static inline void accel_minor_remove(int index) +{ +} + +static inline int accel_minor_alloc(void) +{ + return -EOPNOTSUPP; +} + +static inline int accel_minor_replace(struct drm_minor *minor, int index) +{ + return -EOPNOTSUPP; +} + +static inline void accel_set_device_instance_params(struct device *kdev, int index) +{ +} + #endif /* IS_ENABLED(CONFIG_ACCEL) */ #endif /* DRM_ACCEL_H_ */ diff --git a/include/drm/drm_device.h b/include/drm/drm_device.h index 9923c7a6885e..933ce2048e20 100644 --- a/include/drm/drm_device.h +++ b/include/drm/drm_device.h @@ -93,6 +93,9 @@ struct drm_device { /** @render: Render node */ struct drm_minor *render; + /** @accel: Compute Acceleration node */ + struct drm_minor *accel; + /** * @registered: * diff --git a/include/drm/drm_drv.h b/include/drm/drm_drv.h index f6159acb8856..706e68ca5116 100644 --- a/include/drm/drm_drv.h +++ b/include/drm/drm_drv.h @@ -94,6 +94,14 @@ enum drm_driver_feature { * synchronization of command submission. */ DRIVER_SYNCOBJ_TIMELINE = BIT(6), + /** + * @DRIVER_COMPUTE_ACCEL: + * + * Driver supports compute acceleration devices. This flag is mutually exclusive with + * @DRIVER_RENDER and @DRIVER_MODESET. Devices that support both graphics and compute + * acceleration should be handled by two drivers that are connected using auxiliry bus. + */ + DRIVER_COMPUTE_ACCEL = BIT(7), /* IMPORTANT: Below are all the legacy flags, add new ones above. */ diff --git a/include/drm/drm_file.h b/include/drm/drm_file.h index d780fd151789..0d1f853092ab 100644 --- a/include/drm/drm_file.h +++ b/include/drm/drm_file.h @@ -51,11 +51,15 @@ struct file; /* Note that the order of this enum is ABI (it determines * /dev/dri/renderD* numbers). + * + * Setting DRM_MINOR_ACCEL to 32 gives enough space for more drm minors to + * be implemented before we hit any future */ enum drm_minor_type { DRM_MINOR_PRIMARY, DRM_MINOR_CONTROL, DRM_MINOR_RENDER, + DRM_MINOR_ACCEL = 32, }; /** @@ -70,7 +74,7 @@ enum drm_minor_type { struct drm_minor { /* private: */ int index; /* Minor device number */ - int type; /* Control or render */ + int type; /* Control or render or accel */ struct device *kdev; /* Linux device */ struct drm_device *dev; @@ -397,7 +401,22 @@ static inline bool drm_is_render_client(const struct drm_file *file_priv) return file_priv->minor->type == DRM_MINOR_RENDER; } +/** + * drm_is_accel_client - is this an open file of the compute acceleration node + * @file_priv: DRM file + * + * Returns true if this is an open file of the compute acceleration node, i.e. + * &drm_file.minor of @file_priv is a accel minor. + * + * See also the :ref:`section on accel nodes `. + */ +static inline bool drm_is_accel_client(const struct drm_file *file_priv) +{ + return file_priv->minor->type == DRM_MINOR_ACCEL; +} + int drm_open(struct inode *inode, struct file *filp); +int drm_open_helper(struct file *filp, struct drm_minor *minor); ssize_t drm_read(struct file *filp, char __user *buffer, size_t count, loff_t *offset); int drm_release(struct inode *inode, struct file *filp);