From patchwork Mon Feb 27 20:39:59 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: =?utf-8?q?Andr=C3=A9_Almeida?= X-Patchwork-Id: 5910 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:5915:0:0:0:0:0 with SMTP id v21csp2640371wrd; Mon, 27 Feb 2023 12:55:06 -0800 (PST) X-Google-Smtp-Source: AK7set/fo09KNUm5y4XQBIzUDjOs9ExS/JYjSJ0NlF1auOUv1gwO/b4eQjrlbOxhbKc8QKJ+EOEr X-Received: by 2002:a05:6a21:3383:b0:cb:df6c:acfd with SMTP id yy3-20020a056a21338300b000cbdf6cacfdmr745159pzb.52.1677531306409; Mon, 27 Feb 2023 12:55:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1677531306; cv=none; d=google.com; s=arc-20160816; b=OlxVkGylc6BIt/iSCmMsnlj8osZZx7DAM1lOJqv28YNv1hg24JwGMragABMSpgFjeD pH+gUX0oIuW978hmr2nxDR2BS9/4b1Q5ZgKHH+HwDmn0SN2tNw+oEGmDopvToLGUU0nt rNBcGmsqdlQqVi5T1GZ03ELi30BeBobjl8Pz644BkMOJt+5SPh+3UmihO76OVgADmZnn oIdh4Zz/PSvMRvAXMzv4OykVPqbur+FUxc7AoZAuwjRpUtDXi0DR/CHL2owA9LMSAMip Dc65LiKc3H3/2hL6gbcO19xfQRVprqeHhueM8ZvZoWlEcexOK2U+rbbQPie3y9JKsqNN eC6A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=skccS5dCHnUWuDDB7Qkqv95X91Iw1J6jxu1lqMy1F1M=; b=qIKgcpIwHXykzYUHN1eZBU6Ipi7WfT2D6g+ewQJQ1wOGxcth+1pyGxPb0nidD4pE84 Vf8rcEcEI22voMil4n4GzkgCOPjP2xrLGZ9EYzG3F9H8ilC1tecCohbdSO+KsHNVAu7u qp05QAK5EAEZ0EeQyFmKbvSeOTUUzF9LvD/JRg6wNuuvEDye8gpDXfBzP+x2akwCUX5T WQcxklrcIC217Y/P10/eXC8iest6/8U/42bvJHGAsHhc9eI276hDTAqz4eIFzqv+BFiP Q7lgeEuk2LaFOJM8dRIP7pvBHW8/DvGfyedJy/LmcZ6PZ1Swc6kyW3I9mLAbWq5sG5bS uyKg== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b="OMC/ULOE"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 5-20020a630b05000000b0050324732920si8304800pgl.566.2023.02.27.12.54.49; Mon, 27 Feb 2023 12:55:06 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail header.i=@igalia.com header.s=20170329 header.b="OMC/ULOE"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229753AbjB0Uk1 (ORCPT + 99 others); Mon, 27 Feb 2023 15:40:27 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55424 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229615AbjB0UkZ (ORCPT ); Mon, 27 Feb 2023 15:40:25 -0500 Received: from fanzine2.igalia.com (fanzine2.igalia.com [213.97.179.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A99A940C6 for ; Mon, 27 Feb 2023 12:40:10 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=igalia.com; s=20170329; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-Id: Date:Subject:Cc:To:From:Sender:Reply-To:Content-ID:Content-Description: Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc:Resent-Message-ID: In-Reply-To:References:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=skccS5dCHnUWuDDB7Qkqv95X91Iw1J6jxu1lqMy1F1M=; b=OMC/ULOELfp3GHxMbmwvDd92f+ 3XC5UUjde61y0pWmQVIAsg9N/loY+0FdX/bo47xZiwJ+PmDOMzsIw85uBvLw/Qb/jG71r07NzwQIL Z9rXJqFRqRFNfvfWTThc4TzAyfV5vi1McYXCZ1H33FFiO802/VNBXK6bKDigPI4ymtPlDbS/rl8LN KWXqstkVH4GT8vFkZMZbTYcbmRbOMX4sDfBrGOAR5FKG5gfeJt6y2SRXFp+iRn3vlxfEN3c7k+IQ8 UP0b0weaHwOYq1JWqXiUF9PI7WNAk1o3NQDNiy5gGMWvZwCv+Iy25c0oLtbKrZlPoEotynXQiDqQA YO52NAhA==; Received: from [181.199.58.72] (helo=localhost.localdomain) by fanzine2.igalia.com with esmtpsa (Cipher TLS1.3:ECDHE_X25519__RSA_PSS_RSAE_SHA256__AES_256_GCM:256) (Exim) id 1pWkI2-00H62x-4L; Mon, 27 Feb 2023 21:40:06 +0100 From: =?utf-8?q?Andr=C3=A9_Almeida?= To: dri-devel@lists.freedesktop.org, amd-gfx@lists.freedesktop.org, linux-kernel@vger.kernel.org Cc: kernel-dev@igalia.com, alexander.deucher@amd.com, contactshashanksharma@gmail.com, amaranath.somalapuram@amd.com, christian.koenig@amd.com, pierre-eric.pelloux-prayer@amd.com, "Simon Ser" , "Rob Clark" , "Pekka Paalanen" , "Daniel Vetter" , "Daniel Stone" , =?utf-8?b?TWFyZWsgT2zFocOhaw==?= , "Dave Airlie" , =?utf-8?q?Andr=C3=A9_Almeida?= Subject: [PATCH v2 0/1] drm: Add doc about GPU reset Date: Mon, 27 Feb 2023 15:39:59 -0500 Message-Id: <20230227204000.56787-1-andrealmeid@igalia.com> X-Mailer: git-send-email 2.39.2 MIME-Version: 1.0 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1759019066898456363?= X-GMAIL-MSGID: =?utf-8?q?1759019066898456363?= Hi, Thanks everyone that gave feedback. v2 Changes: - This new version is a section of drm-uapi instead of a new file - Drop requirement for KMD to kill applications - Drop role of init systems on compositors recover - Drop assumption that robust apps creates new contexts Original cover letter bellow: Due to the complexity of its stack and the apps that we run on it, GPU resets are for granted. What's left for driver developers is how to make resets a smooth experience as possible. While some OS's can recover or show an error message in such cases, Linux is more a hit-and-miss due to its lack of standardization and guidelines of what to do in such cases. This is the goal of this document, to proper define what should happen after a GPU reset so developers can start acting on top of this. An IGT test should be created to validate this for each driver. Initially my approach was to expose an uevent for GPU resets, as it can be seen here[1]. However, even if an uevent can be useful for some use cases (e.g. telemetry and error reporting), for the "OS integration" case of GPU resets it would be more productive to have something defined through the stack. Thanks, André [1] https://lore.kernel.org/amd-gfx/20221125175203.52481-1-andrealmeid@igalia.com/ André Almeida (1): drm/doc: Document DRM device reset expectations Documentation/gpu/drm-uapi.rst | 51 ++++++++++++++++++++++++++++++++++ 1 file changed, 51 insertions(+)