Message ID | 20230310074000.2078124-1-lizhenneng@kylinos.cn |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:5915:0:0:0:0:0 with SMTP id v21csp735504wrd; Thu, 9 Mar 2023 23:42:48 -0800 (PST) X-Google-Smtp-Source: AK7set+ZLpR4skivvvAB+77Eh7zFGoLUIWp5udu1Ke2vF5Gzzbf/XK5KrdU29d5CkeWVZyqd5UeK X-Received: by 2002:a17:903:24d:b0:19e:76b7:c7d2 with SMTP id j13-20020a170903024d00b0019e76b7c7d2mr28967497plh.26.1678434167852; Thu, 09 Mar 2023 23:42:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1678434167; cv=none; d=google.com; s=arc-20160816; b=RaFEQ7jqXFHtDqVh0Imm/XgwzVlOxfHSX0VYQoFkEgFXOXkTafthruXQw6HUwpucO0 HCfsILSlopBt1w5K7op9n75c0yBa1SulpWGgrfvv+aAi02i0LnEp38jpQpZnC0N10oZN BUEyGzjL0KIh3y0ryXImFqjCMyjuaKo6OMphQDbJDdejUqVA6AOJxdTl87gU0Od8s60J sUyZE4Z5CY5ulAhTpZQLfPU20fN7fyJgW4gSV+5rYmChPbvYbwRJ+ZEbJ/QC0ThHAifh AQQTCVLR35YewREcHIkvCe5tKC6BshabMHqlG6cVm/h3hYIsaGbRCk1QyaFosXgWo5bi nIFg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=dzUPv2nS8rBf3LkcegPaTZw2LxdQjbAreWOz/o+EvbI=; b=PNtyAzz6v+LknAKG7GVKUEsnSB6ZpJ1bCcnqd3jmY4ntqMQRQvn3CxJ0LqqDM5miiN F7D6affFO+4kHn9c5zv0sperWIxZUGv6gXxoY3M/mzQENk7hOT4jUoqIsZdlcVwmV/PC SQ7Y83A/CCPbYi0BYsU7d4bpmHw6Sx022T3TgkwWvzKDT3SGmfoG+UPfh1qAyOhbh5Cy ob8GTn5Jf1g92ZEUmTWqrf6eWqKqECszbDQn4i0CjSn/rMtGNnBj/PGKQEwySJqO4pTl HKU6Yrsg6xj6H+EP5tmRNDmtykr3JGTwVjFoflqbUInAhSdAa7a/HiBHSb9jGY2MrQ9E CkTA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id kf12-20020a17090305cc00b0019ace4c5bd5si1375352plb.513.2023.03.09.23.42.32; Thu, 09 Mar 2023 23:42:47 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230321AbjCJHlV (ORCPT <rfc822;carlos.wei.hk@gmail.com> + 99 others); Fri, 10 Mar 2023 02:41:21 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34282 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230332AbjCJHkY (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Fri, 10 Mar 2023 02:40:24 -0500 Received: from mailgw.kylinos.cn (mailgw.kylinos.cn [124.126.103.232]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 062ACF8647 for <linux-kernel@vger.kernel.org>; Thu, 9 Mar 2023 23:40:11 -0800 (PST) X-UUID: bfaf3b3b4df0410c8d7f35f284529c16-20230310 X-CPASD-INFO: 9d00fa27f33d41c298fadc8022794e1a@grRqUJOTYGeSU3Wwg3SubFlpmo2Esnp qUmJlaWCFhrOMbFJkYl1SgIFqVmprYmBZUHt1bFZgYGJdUXh3lHKPVGBeYIJUdJOAe55QYZSR X-CLOUD-ID: 9d00fa27f33d41c298fadc8022794e1a X-CPASD-SUMMARY: SIP:-1,APTIP:-2.0,KEY:0.0,FROMBLOCK:1,OB:0.0,URL:-5,TVAL:196. 0,ESV:0.0,ECOM:-5.0,ML:14.0,FD:0.0,CUTS:112.0,IP:-2.0,MAL:-5.0,PHF:-5.0,PHC:- 5.0,SPF:4.0,EDMS:-5,IPLABEL:4480.0,FROMTO:0,AD:0,FFOB:0.0,CFOB:1.0,SPC:0,SIG: -5,AUF:7,DUF:17729,ACD:257,DCD:257,SL:0,EISP:0,AG:0,CFC:0.319,CFSR:0.082,UAT: 0,RAF:0,IMG:-5.0,DFA:0,DTA:0,IBL:-2.0,ADI:-5,SBL:0,REDM:0,REIP:0,ESB:0,ATTNUM :0,EAF:0,CID:-5.0,VERSION:2.3.17 X-CPASD-ID: bfaf3b3b4df0410c8d7f35f284529c16-20230310 X-CPASD-BLOCK: 14 X-CPASD-STAGE: 1 X-UUID: bfaf3b3b4df0410c8d7f35f284529c16-20230310 X-User: lizhenneng@kylinos.cn Received: from localhost.localdomain [(116.128.244.169)] by mailgw (envelope-from <lizhenneng@kylinos.cn>) (Generic MTA) with ESMTP id 1495147978; Fri, 10 Mar 2023 15:40:09 +0800 From: Zhenneng Li <lizhenneng@kylinos.cn> To: Alex Deucher <alexander.deucher@amd.com> Cc: =?utf-8?q?Christian_K=C3=B6nig?= <christian.koenig@amd.com>, Pan Xinhui <Xinhui.Pan@amd.com>, David Airlie <airlied@linux.ie>, Daniel Vetter <daniel@ffwll.ch>, amd-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org, linux-kernel@vger.kernel.org, Zhenneng Li <lizhenneng@kylinos.cn> Subject: [PATCH] drm/amdgpu: resove reboot exception for si oland Date: Fri, 10 Mar 2023 15:39:59 +0800 Message-Id: <20230310074000.2078124-1-lizhenneng@kylinos.cn> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_PASS,UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1759965786298511005?= X-GMAIL-MSGID: =?utf-8?q?1759965786298511005?= |
Series |
drm/amdgpu: resove reboot exception for si oland
|
|
Commit Message
李真能
March 10, 2023, 7:39 a.m. UTC
During reboot test on arm64 platform, it may failure on boot. The error message are as follows: [ 6.996395][ 7] [ T295] [drm:amdgpu_device_ip_late_init [amdgpu]] *ERROR* late_init of IP block <si_dpm> failed -22 [ 7.006919][ 7] [ T295] amdgpu 0000:04:00.0: amdgpu_device_ip_late_init failed [ 7.014224][ 7] [ T295] amdgpu 0000:04:00.0: Fatal error during GPU init --- drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c | 3 --- 1 file changed, 3 deletions(-)
Comments
> -----Original Message----- > From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of > Zhenneng Li > Sent: Friday, March 10, 2023 3:40 PM > To: Deucher, Alexander <Alexander.Deucher@amd.com> > Cc: David Airlie <airlied@linux.ie>; Pan, Xinhui <Xinhui.Pan@amd.com>; > linux-kernel@vger.kernel.org; dri-devel@lists.freedesktop.org; Zhenneng Li > <lizhenneng@kylinos.cn>; amd-gfx@lists.freedesktop.org; Daniel Vetter > <daniel@ffwll.ch>; Koenig, Christian <Christian.Koenig@amd.com> > Subject: [PATCH] drm/amdgpu: resove reboot exception for si oland > > During reboot test on arm64 platform, it may failure on boot. > > The error message are as follows: > [ 6.996395][ 7] [ T295] [drm:amdgpu_device_ip_late_init [amdgpu]] > *ERROR* > late_init of IP block <si_dpm> failed -22 > [ 7.006919][ 7] [ T295] amdgpu 0000:04:00.0: amdgpu_device_ip_late_init > failed > [ 7.014224][ 7] [ T295] amdgpu 0000:04:00.0: Fatal error during GPU init > --- > drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c | 3 --- > 1 file changed, 3 deletions(-) > > diff --git a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > index d6d9e3b1b2c0..dee51c757ac0 100644 > --- a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > +++ b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > @@ -7632,9 +7632,6 @@ static int si_dpm_late_init(void *handle) > if (!adev->pm.dpm_enabled) > return 0; > > - ret = si_set_temperature_range(adev); > - if (ret) > - return ret; si_set_temperature_range should be platform agnostic. Can you please elaborate more? Regards, Guchun > #if 0 //TODO ? > si_dpm_powergate_uvd(adev, true); > #endif > -- > 2.25.1
On Fri, Mar 10, 2023 at 3:18 AM Chen, Guchun <Guchun.Chen@amd.com> wrote: > > > > -----Original Message----- > > From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of > > Zhenneng Li > > Sent: Friday, March 10, 2023 3:40 PM > > To: Deucher, Alexander <Alexander.Deucher@amd.com> > > Cc: David Airlie <airlied@linux.ie>; Pan, Xinhui <Xinhui.Pan@amd.com>; > > linux-kernel@vger.kernel.org; dri-devel@lists.freedesktop.org; Zhenneng Li > > <lizhenneng@kylinos.cn>; amd-gfx@lists.freedesktop.org; Daniel Vetter > > <daniel@ffwll.ch>; Koenig, Christian <Christian.Koenig@amd.com> > > Subject: [PATCH] drm/amdgpu: resove reboot exception for si oland > > > > During reboot test on arm64 platform, it may failure on boot. > > > > The error message are as follows: > > [ 6.996395][ 7] [ T295] [drm:amdgpu_device_ip_late_init [amdgpu]] > > *ERROR* > > late_init of IP block <si_dpm> failed -22 > > [ 7.006919][ 7] [ T295] amdgpu 0000:04:00.0: amdgpu_device_ip_late_init > > failed > > [ 7.014224][ 7] [ T295] amdgpu 0000:04:00.0: Fatal error during GPU init > > --- > > drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c | 3 --- > > 1 file changed, 3 deletions(-) > > > > diff --git a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > > b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > > index d6d9e3b1b2c0..dee51c757ac0 100644 > > --- a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > > +++ b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c > > @@ -7632,9 +7632,6 @@ static int si_dpm_late_init(void *handle) > > if (!adev->pm.dpm_enabled) > > return 0; > > > > - ret = si_set_temperature_range(adev); > > - if (ret) > > - return ret; > > si_set_temperature_range should be platform agnostic. Can you please elaborate more? > Yes. Not setting this means we won't get thermal interrupts. We shouldn't skip this. Alex > Regards, > Guchun > > > #if 0 //TODO ? > > si_dpm_powergate_uvd(adev, true); > > #endif > > -- > > 2.25.1 >
This bug is first reported here: https://lore.kernel.org/lkml/1a620e7c-5b71-3d16-001a-0d79b292aca7@amd.com/ I modify the patch accroding mail list's discusstion, and I do reboot test for tens of thousands of times about 10 machines on arm64, there's no bug reported. 在 2023/3/10 16:18, Chen, Guchun 写道: >> -----Original Message----- >> From: amd-gfx <amd-gfx-bounces@lists.freedesktop.org> On Behalf Of >> Zhenneng Li >> Sent: Friday, March 10, 2023 3:40 PM >> To: Deucher, Alexander <Alexander.Deucher@amd.com> >> Cc: David Airlie <airlied@linux.ie>; Pan, Xinhui <Xinhui.Pan@amd.com>; >> linux-kernel@vger.kernel.org; dri-devel@lists.freedesktop.org; Zhenneng Li >> <lizhenneng@kylinos.cn>; amd-gfx@lists.freedesktop.org; Daniel Vetter >> <daniel@ffwll.ch>; Koenig, Christian <Christian.Koenig@amd.com> >> Subject: [PATCH] drm/amdgpu: resove reboot exception for si oland >> >> During reboot test on arm64 platform, it may failure on boot. >> >> The error message are as follows: >> [ 6.996395][ 7] [ T295] [drm:amdgpu_device_ip_late_init [amdgpu]] >> *ERROR* >> late_init of IP block <si_dpm> failed -22 >> [ 7.006919][ 7] [ T295] amdgpu 0000:04:00.0: amdgpu_device_ip_late_init >> failed >> [ 7.014224][ 7] [ T295] amdgpu 0000:04:00.0: Fatal error during GPU init >> --- >> drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c | 3 --- >> 1 file changed, 3 deletions(-) >> >> diff --git a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c >> b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c >> index d6d9e3b1b2c0..dee51c757ac0 100644 >> --- a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c >> +++ b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c >> @@ -7632,9 +7632,6 @@ static int si_dpm_late_init(void *handle) >> if (!adev->pm.dpm_enabled) >> return 0; >> >> - ret = si_set_temperature_range(adev); >> - if (ret) >> - return ret; > si_set_temperature_range should be platform agnostic. Can you please elaborate more? > > Regards, > Guchun > >> #if 0 //TODO ? >> si_dpm_powergate_uvd(adev, true); >> #endif >> -- >> 2.25.1
diff --git a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c index d6d9e3b1b2c0..dee51c757ac0 100644 --- a/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c +++ b/drivers/gpu/drm/amd/pm/legacy-dpm/si_dpm.c @@ -7632,9 +7632,6 @@ static int si_dpm_late_init(void *handle) if (!adev->pm.dpm_enabled) return 0; - ret = si_set_temperature_range(adev); - if (ret) - return ret; #if 0 //TODO ? si_dpm_powergate_uvd(adev, true); #endif