Message ID | 1690892953-25201-1-git-send-email-schakrabarti@linux.microsoft.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:918b:0:b0:3e4:2afc:c1 with SMTP id s11csp2652394vqg; Tue, 1 Aug 2023 06:04:12 -0700 (PDT) X-Google-Smtp-Source: APBJJlEd3jAC/D3N8CFXWsYIhqG73vzziBfvkfMgP8hrKMGHSVdIOaleWN9Kbr6Nr3VCtVq4EmJw X-Received: by 2002:a05:6a21:3e05:b0:133:2a73:fdd5 with SMTP id bk5-20020a056a213e0500b001332a73fdd5mr11963103pzc.32.1690895051901; Tue, 01 Aug 2023 06:04:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1690895051; cv=none; d=google.com; s=arc-20160816; b=KhobO/JgTDCUIKUx25Zodizv5G0Q1TTUowYKKAHuTq3n2Wtp0KGoHfMTpszf9fjbrZ FAdq69iNtogS2RuJ2rGm2vJcnmFz3oBHKJTElXGOug8V86avI+3rKA3OIpZ8NvxeKXP+ IQHwvknIkqW8pGcRjQPRKIevypnJaz5u1hl+/NSF8SVRj6NR+DSbUs3ySl8QZuyeuIRz BRkbHru+n+ZzqPLPB6cId7Jr4lK4J6gqAKQJ2Vbmddu9hFkircUaLTJ4IVqQ6cHFXmur OTLCgkOLVT2UfGlKHajsELoEeg+ckrauZG0+gIVgk68qIJjdGPXyPnF3DPbY9Z/PpAQ8 MB8g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:date:subject:cc:to:from :dkim-signature:dkim-filter; bh=Le5IWUlV+SOiU8nvbAjGXFDnfuBjbT8nhrykMj0k14k=; fh=CRXlYauY4/Fm9OvL6NavBYnlK8EESZebSVy0W94b4Sg=; b=IrlD+easiRVxrvnNpuMsUKGmsPm+NW9oJx4kCE8wIweysozRrnwXhm5hC2CjPqcqK2 l7M941yq0UdnNUGSz27mN6GuVVJ2hOkLwPqGodpWSfIZt7e927T/2k/hpm/gZG3RZPr8 VM8NPrXkWKwfXCQ/TO+a1SP30gvrpiAdkjUtF1ewpGnF2lEDaBr4+SNLE400stEj9hmp SUOjrt9nuEJBMV28lGVNdGRG+E/rTeBW8QX+SXkjx2n+Ds6xqng5rPpM2wEWeh+nlTyk BktcYS/qosbFHFrc5cz6of3BdEKHbSbww1I+AgM00hfO31KJSgK1E/izFMTnp3jZ+zmP Yo8A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux.microsoft.com header.s=default header.b=QbxCkPlX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id q11-20020a056a00150b00b006871fa2ee54si5367214pfu.99.2023.08.01.06.03.58; Tue, 01 Aug 2023 06:04:11 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.microsoft.com header.s=default header.b=QbxCkPlX; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.microsoft.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232773AbjHAM37 (ORCPT <rfc822;maxi.paulin@gmail.com> + 99 others); Tue, 1 Aug 2023 08:29:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38110 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232579AbjHAM35 (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Tue, 1 Aug 2023 08:29:57 -0400 Received: from linux.microsoft.com (linux.microsoft.com [13.77.154.182]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 399961FC6; Tue, 1 Aug 2023 05:29:56 -0700 (PDT) Received: by linux.microsoft.com (Postfix, from userid 1099) id 87B5A238AF50; Tue, 1 Aug 2023 05:29:55 -0700 (PDT) DKIM-Filter: OpenDKIM Filter v2.11.0 linux.microsoft.com 87B5A238AF50 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.microsoft.com; s=default; t=1690892995; bh=Le5IWUlV+SOiU8nvbAjGXFDnfuBjbT8nhrykMj0k14k=; h=From:To:Cc:Subject:Date:From; b=QbxCkPlXOL/LLmKwjN3yXH6ja2Lg3P8hNezx5avxJmSJyLmBcOP9KapwTglQPrWgI rRV1trxCqelfaklwj2dNcuOWZSeI38r8SfjIGl+2JI/8e3XBUHCvdcdwhMbb5c0Oaf TOxfJApD/w9ureNlZ3mTBBTnR8k+zD5B2SV+Tp7M= From: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com> To: kys@microsoft.com, haiyangz@microsoft.com, wei.liu@kernel.org, decui@microsoft.com, davem@davemloft.net, edumazet@google.com, kuba@kernel.org, pabeni@redhat.com, longli@microsoft.com, sharmaajay@microsoft.com, leon@kernel.org, cai.huoqing@linux.dev, ssengar@linux.microsoft.com, vkuznets@redhat.com, tglx@linutronix.de, linux-hyperv@vger.kernel.org, netdev@vger.kernel.org, linux-kernel@vger.kernel.org, linux-rdma@vger.kernel.org Cc: schakrabarti@microsoft.com, Souradeep Chakrabarti <schakrabarti@linux.microsoft.com>, stable@vger.kernel.org Subject: [PATCH V7 net] net: mana: Fix MANA VF unload when hardware is Date: Tue, 1 Aug 2023 05:29:13 -0700 Message-Id: <1690892953-25201-1-git-send-email-schakrabarti@linux.microsoft.com> X-Mailer: git-send-email 1.8.3.1 X-Spam-Status: No, score=-17.5 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_BLOCKED, SPF_HELO_PASS,SPF_PASS,T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL, USER_IN_DEF_SPF_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1773031969834528225 X-GMAIL-MSGID: 1773031969834528225 |
Series |
[V7,net] net: mana: Fix MANA VF unload when hardware is
|
|
Commit Message
Souradeep Chakrabarti
Aug. 1, 2023, 12:29 p.m. UTC
When unloading the MANA driver, mana_dealloc_queues() waits for the MANA
hardware to complete any inflight packets and set the pending send count
to zero. But if the hardware has failed, mana_dealloc_queues()
could wait forever.
Fix this by adding a timeout to the wait. Set the timeout to 120 seconds,
which is a somewhat arbitrary value that is more than long enough for
functional hardware to complete any sends.
Cc: stable@vger.kernel.org
Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure Network Adapter (MANA)")
Signed-off-by: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com>
---
V6 -> V7:
* Optimized the while loop for freeing skb.
V5 -> V6:
* Added pcie_flr to reset the pci after timeout.
* Fixed the position of changelog.
* Removed unused variable like cq.
V4 -> V5:
* Added fixes tag
* Changed the usleep_range from static to incremental value.
* Initialized timeout in the begining.
V3 -> V4:
* Removed the unnecessary braces from mana_dealloc_queues().
V2 -> V3:
* Removed the unnecessary braces from mana_dealloc_queues().
V1 -> V2:
* Added net branch
* Removed the typecasting to (struct mana_context*) of void pointer
* Repositioned timeout variable in mana_dealloc_queues()
* Repositioned vf_unload_timeout in mana_context struct, to utilise the
6 bytes hole
---
drivers/net/ethernet/microsoft/mana/mana_en.c | 37 +++++++++++++++++--
1 file changed, 33 insertions(+), 4 deletions(-)
Comments
On Tue, Aug 01, 2023 at 05:29:13AM -0700, Souradeep Chakrabarti wrote: ... Hi Souradeep, > + for (i = 0; i < apc->num_queues; i++) { > + txq = &apc->tx_qp[i].txq; > + while (skb = skb_dequeue(&txq->pending_skbs)) { W=1 builds with both clang-16 and gcc-12 complain that they would like an extra set of parentheses around an assignment used as a truth value. > + mana_unmap_skb(skb, apc); > + dev_consume_skb_any(skb); > + } > + atomic_set(&txq->pending_sends, 0); > + } > /* We're 100% sure the queues can no longer be woken up, because > * we're sure now mana_poll_tx_cq() can't be running. > */ > -- > 2.34.1 > >
On 8/1/2023 5:29 AM, Souradeep Chakrabarti wrote: > When unloading the MANA driver, mana_dealloc_queues() waits for the MANA > hardware to complete any inflight packets and set the pending send count > to zero. But if the hardware has failed, mana_dealloc_queues() > could wait forever. > > Fix this by adding a timeout to the wait. Set the timeout to 120 seconds, tx timeout in stack defaults to 5 seconds, do you not have that on? What happens when you start getting resets while unloading? > which is a somewhat arbitrary value that is more than long enough for > functional hardware to complete any sends. I'd say 2 or 5 seconds is probably plenty of time to hang up a driver unload. > > Cc: stable@vger.kernel.org > Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure Network Adapter (MANA)") > > Signed-off-by: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com> keep s-o-b and other trailers together please, no spaces, it messes up git and doesn't conform to kernel standards. > --- > V6 -> V7: > * Optimized the while loop for freeing skb. > > V5 -> V6: > * Added pcie_flr to reset the pci after timeout. > * Fixed the position of changelog. > * Removed unused variable like cq. > > V4 -> V5: > * Added fixes tag > * Changed the usleep_range from static to incremental value. > * Initialized timeout in the begining. > > V3 -> V4: > * Removed the unnecessary braces from mana_dealloc_queues(). > > V2 -> V3: > * Removed the unnecessary braces from mana_dealloc_queues(). > > V1 -> V2: > * Added net branch > * Removed the typecasting to (struct mana_context*) of void pointer > * Repositioned timeout variable in mana_dealloc_queues() > * Repositioned vf_unload_timeout in mana_context struct, to utilise the > 6 bytes hole > --- > drivers/net/ethernet/microsoft/mana/mana_en.c | 37 +++++++++++++++++-- > 1 file changed, 33 insertions(+), 4 deletions(-) > > diff --git a/drivers/net/ethernet/microsoft/mana/mana_en.c b/drivers/net/ethernet/microsoft/mana/mana_en.c > index a499e460594b..3c5552a176d0 100644 > --- a/drivers/net/ethernet/microsoft/mana/mana_en.c > +++ b/drivers/net/ethernet/microsoft/mana/mana_en.c > @@ -8,6 +8,7 @@ > #include <linux/ethtool.h> > #include <linux/filter.h> > #include <linux/mm.h> > +#include <linux/pci.h> > > #include <net/checksum.h> > #include <net/ip6_checksum.h> > @@ -2345,9 +2346,12 @@ int mana_attach(struct net_device *ndev) > static int mana_dealloc_queues(struct net_device *ndev) > { > struct mana_port_context *apc = netdev_priv(ndev); > + unsigned long timeout = jiffies + 120 * HZ; > struct gdma_dev *gd = apc->ac->gdma_dev; > struct mana_txq *txq; > + struct sk_buff *skb; > int i, err; > + u32 tsleep; > > if (apc->port_is_up) > return -EINVAL; > @@ -2363,15 +2367,40 @@ static int mana_dealloc_queues(struct net_device *ndev) > * to false, but it doesn't matter since mana_start_xmit() drops any > * new packets due to apc->port_is_up being false. > * > - * Drain all the in-flight TX packets > + * Drain all the in-flight TX packets. > + * A timeout of 120 seconds for all the queues is used. > + * This will break the while loop when h/w is not responding. > + * This value of 120 has been decided here considering max > + * number of queues. > */ > + > for (i = 0; i < apc->num_queues; i++) { > txq = &apc->tx_qp[i].txq; > - > - while (atomic_read(&txq->pending_sends) > 0) > - usleep_range(1000, 2000); > + tsleep = 1000; > + while (atomic_read(&txq->pending_sends) > 0 && > + time_before(jiffies, timeout)) { > + usleep_range(tsleep, tsleep + 1000); > + tsleep <<= 1; > + } > + if (atomic_read(&txq->pending_sends)) { > + err = pcie_flr(to_pci_dev(gd->gdma_context->dev)); > + if (err) { > + netdev_err(ndev, "flr failed %d with %d pkts pending in txq %u\n", > + err, atomic_read(&txq->pending_sends), > + txq->gdma_txq_id); > + } > + break; > + } > } > > + for (i = 0; i < apc->num_queues; i++) { > + txq = &apc->tx_qp[i].txq; > + while (skb = skb_dequeue(&txq->pending_skbs)) { > + mana_unmap_skb(skb, apc); > + dev_consume_skb_any(skb); dev_kfree_skb_any() would be more correct here since this is an error path and the transmit is presumed dropped, isn't it? > + } > + atomic_set(&txq->pending_sends, 0); > + } > /* We're 100% sure the queues can no longer be woken up, because > * we're sure now mana_poll_tx_cq() can't be running. > */
>-----Original Message----- >From: Jesse Brandeburg <jesse.brandeburg@intel.com> >Sent: Tuesday, August 1, 2023 11:34 PM >To: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com>; KY Srinivasan ><kys@microsoft.com>; Haiyang Zhang <haiyangz@microsoft.com>; >wei.liu@kernel.org; Dexuan Cui <decui@microsoft.com>; >davem@davemloft.net; edumazet@google.com; kuba@kernel.org; >pabeni@redhat.com; Long Li <longli@microsoft.com>; Ajay Sharma ><sharmaajay@microsoft.com>; leon@kernel.org; cai.huoqing@linux.dev; >ssengar@linux.microsoft.com; vkuznets <vkuznets@redhat.com>; >tglx@linutronix.de; linux-hyperv@vger.kernel.org; netdev@vger.kernel.org; >linux-kernel@vger.kernel.org; linux-rdma@vger.kernel.org >Cc: Souradeep Chakrabarti <schakrabarti@microsoft.com>; >stable@vger.kernel.org >Subject: [EXTERNAL] Re: [PATCH V7 net] net: mana: Fix MANA VF unload when >hardware is > >On 8/1/2023 5:29 AM, Souradeep Chakrabarti wrote: >> When unloading the MANA driver, mana_dealloc_queues() waits for the >> MANA hardware to complete any inflight packets and set the pending >> send count to zero. But if the hardware has failed, >> mana_dealloc_queues() could wait forever. >> >> Fix this by adding a timeout to the wait. Set the timeout to 120 >> seconds, > >tx timeout in stack defaults to 5 seconds, do you not have that on? What >happens when you start getting resets while unloading? > >> which is a somewhat arbitrary value that is more than long enough for >> functional hardware to complete any sends. > >I'd say 2 or 5 seconds is probably plenty of time to hang up a driver unload. > Thank you for the comment. This was already discussed in V4. I am just sharing the summary here: This waiting time is usually much shorter than 120 sec. The long wait only happens in rare and unexpected NIC HW non-responding cases. At that point, we don't actually care if the pending packets are sent or not. But if we free the queues too soon, and the HW is slow for unexpected reasons, a delayed completion notice will DMA into the freed memory and cause corruption. That's why we have a longer waiting time. >> >> Cc: stable@vger.kernel.org >> Fixes: ca9c54d2d6a5 ("net: mana: Add a driver for Microsoft Azure >> Network Adapter (MANA)") >> >> Signed-off-by: Souradeep Chakrabarti >> <schakrabarti@linux.microsoft.com> > >keep s-o-b and other trailers together please, no spaces, it messes up git and >doesn't conform to kernel standards. > > >> --- >> V6 -> V7: >> * Optimized the while loop for freeing skb. >> >> V5 -> V6: >> * Added pcie_flr to reset the pci after timeout. >> * Fixed the position of changelog. >> * Removed unused variable like cq. >> >> V4 -> V5: >> * Added fixes tag >> * Changed the usleep_range from static to incremental value. >> * Initialized timeout in the begining. >> >> V3 -> V4: >> * Removed the unnecessary braces from mana_dealloc_queues(). >> >> V2 -> V3: >> * Removed the unnecessary braces from mana_dealloc_queues(). >> >> V1 -> V2: >> * Added net branch >> * Removed the typecasting to (struct mana_context*) of void pointer >> * Repositioned timeout variable in mana_dealloc_queues() >> * Repositioned vf_unload_timeout in mana_context struct, to utilise >> the >> 6 bytes hole >> --- >> drivers/net/ethernet/microsoft/mana/mana_en.c | 37 >> +++++++++++++++++-- >> 1 file changed, 33 insertions(+), 4 deletions(-) >> >> diff --git a/drivers/net/ethernet/microsoft/mana/mana_en.c >> b/drivers/net/ethernet/microsoft/mana/mana_en.c >> index a499e460594b..3c5552a176d0 100644 >> --- a/drivers/net/ethernet/microsoft/mana/mana_en.c >> +++ b/drivers/net/ethernet/microsoft/mana/mana_en.c >> @@ -8,6 +8,7 @@ >> #include <linux/ethtool.h> >> #include <linux/filter.h> >> #include <linux/mm.h> >> +#include <linux/pci.h> >> >> #include <net/checksum.h> >> #include <net/ip6_checksum.h> >> @@ -2345,9 +2346,12 @@ int mana_attach(struct net_device *ndev) >> static int mana_dealloc_queues(struct net_device *ndev) { >> struct mana_port_context *apc = netdev_priv(ndev); >> + unsigned long timeout = jiffies + 120 * HZ; >> struct gdma_dev *gd = apc->ac->gdma_dev; >> struct mana_txq *txq; >> + struct sk_buff *skb; >> int i, err; >> + u32 tsleep; >> >> if (apc->port_is_up) >> return -EINVAL; >> @@ -2363,15 +2367,40 @@ static int mana_dealloc_queues(struct >net_device *ndev) >> * to false, but it doesn't matter since mana_start_xmit() drops any >> * new packets due to apc->port_is_up being false. >> * >> - * Drain all the in-flight TX packets >> + * Drain all the in-flight TX packets. >> + * A timeout of 120 seconds for all the queues is used. >> + * This will break the while loop when h/w is not responding. >> + * This value of 120 has been decided here considering max >> + * number of queues. >> */ >> + >> for (i = 0; i < apc->num_queues; i++) { >> txq = &apc->tx_qp[i].txq; >> - >> - while (atomic_read(&txq->pending_sends) > 0) >> - usleep_range(1000, 2000); >> + tsleep = 1000; >> + while (atomic_read(&txq->pending_sends) > 0 && >> + time_before(jiffies, timeout)) { >> + usleep_range(tsleep, tsleep + 1000); >> + tsleep <<= 1; >> + } >> + if (atomic_read(&txq->pending_sends)) { >> + err = pcie_flr(to_pci_dev(gd->gdma_context->dev)); >> + if (err) { >> + netdev_err(ndev, "flr failed %d with %d pkts >pending in txq %u\n", >> + err, atomic_read(&txq- >>pending_sends), >> + txq->gdma_txq_id); >> + } >> + break; >> + } >> } >> >> + for (i = 0; i < apc->num_queues; i++) { >> + txq = &apc->tx_qp[i].txq; >> + while (skb = skb_dequeue(&txq->pending_skbs)) { >> + mana_unmap_skb(skb, apc); >> + dev_consume_skb_any(skb); > >dev_kfree_skb_any() would be more correct here since this is an error path and >the transmit is presumed dropped, isn't it? Yes, dev_kfree_skb_any() will be a better approach in this scenario. Will change it in next version. > > >> + } >> + atomic_set(&txq->pending_sends, 0); >> + } >> /* We're 100% sure the queues can no longer be woken up, because >> * we're sure now mana_poll_tx_cq() can't be running. >> */
>-----Original Message----- >From: Simon Horman <horms@kernel.org> >Sent: Tuesday, August 1, 2023 9:01 PM >To: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com> >Cc: KY Srinivasan <kys@microsoft.com>; Haiyang Zhang ><haiyangz@microsoft.com>; wei.liu@kernel.org; Dexuan Cui ><decui@microsoft.com>; davem@davemloft.net; edumazet@google.com; >kuba@kernel.org; pabeni@redhat.com; Long Li <longli@microsoft.com>; Ajay >Sharma <sharmaajay@microsoft.com>; leon@kernel.org; >cai.huoqing@linux.dev; ssengar@linux.microsoft.com; vkuznets ><vkuznets@redhat.com>; tglx@linutronix.de; linux-hyperv@vger.kernel.org; >netdev@vger.kernel.org; linux-kernel@vger.kernel.org; linux- >rdma@vger.kernel.org; Souradeep Chakrabarti ><schakrabarti@microsoft.com>; stable@vger.kernel.org >Subject: [EXTERNAL] Re: [PATCH V7 net] net: mana: Fix MANA VF unload when >hardware is > >On Tue, Aug 01, 2023 at 05:29:13AM -0700, Souradeep Chakrabarti wrote: > >... > >Hi Souradeep, > > >> + for (i = 0; i < apc->num_queues; i++) { >> + txq = &apc->tx_qp[i].txq; >> + while (skb = skb_dequeue(&txq->pending_skbs)) { > >W=1 builds with both clang-16 and gcc-12 complain that they would like an >extra set of parentheses around an assignment used as a truth value. Thanks for letting me know. I will fix it in next version. > >> + mana_unmap_skb(skb, apc); >> + dev_consume_skb_any(skb); >> + } >> + atomic_set(&txq->pending_sends, 0); >> + } >> /* We're 100% sure the queues can no longer be woken up, because >> * we're sure now mana_poll_tx_cq() can't be running. >> */ >> -- >> 2.34.1 >> >>
On Wed, Aug 02, 2023 at 10:57:52AM +0530, Kalesh Anakkur Purayil wrote: > Hi Souradeep, > > It looks like the subject line is not complete. I could see "net: mana: Fix > MANA VF unload when hardware is". > > Is that correct? > > Regards, > Kalesh > Yes, it got truncated. Will fix it in next version. > On Wed, Aug 2, 2023 at 12:29 AM Souradeep Chakrabarti < > schakrabarti@microsoft.com> wrote: > > > > > > > >-----Original Message----- > > >From: Simon Horman <horms@kernel.org> > > >Sent: Tuesday, August 1, 2023 9:01 PM > > >To: Souradeep Chakrabarti <schakrabarti@linux.microsoft.com> > > >Cc: KY Srinivasan <kys@microsoft.com>; Haiyang Zhang > > ><haiyangz@microsoft.com>; wei.liu@kernel.org; Dexuan Cui > > ><decui@microsoft.com>; davem@davemloft.net; edumazet@google.com; > > >kuba@kernel.org; pabeni@redhat.com; Long Li <longli@microsoft.com>; Ajay > > >Sharma <sharmaajay@microsoft.com>; leon@kernel.org; > > >cai.huoqing@linux.dev; ssengar@linux.microsoft.com; vkuznets > > ><vkuznets@redhat.com>; tglx@linutronix.de; linux-hyperv@vger.kernel.org; > > >netdev@vger.kernel.org; linux-kernel@vger.kernel.org; linux- > > >rdma@vger.kernel.org; Souradeep Chakrabarti > > ><schakrabarti@microsoft.com>; stable@vger.kernel.org > > >Subject: [EXTERNAL] Re: [PATCH V7 net] net: mana: Fix MANA VF unload when > > >hardware is > > > > > >On Tue, Aug 01, 2023 at 05:29:13AM -0700, Souradeep Chakrabarti wrote: > > > > > >... > > > > > >Hi Souradeep, > > > > > > > > >> + for (i = 0; i < apc->num_queues; i++) { > > >> + txq = &apc->tx_qp[i].txq; > > >> + while (skb = skb_dequeue(&txq->pending_skbs)) { > > > > > >W=1 builds with both clang-16 and gcc-12 complain that they would like an > > >extra set of parentheses around an assignment used as a truth value. > > Thanks for letting me know. I will fix it in next version. > > > > > >> + mana_unmap_skb(skb, apc); > > >> + dev_consume_skb_any(skb); > > >> + } > > >> + atomic_set(&txq->pending_sends, 0); > > >> + } > > >> /* We're 100% sure the queues can no longer be woken up, because > > >> * we're sure now mana_poll_tx_cq() can't be running. > > >> */ > > >> -- > > >> 2.34.1 > > >> > > >> > > > > > > -- > Regards, > Kalesh A P
Hi Souradeep, kernel test robot noticed the following build warnings: [auto build test WARNING on net/main] url: https://github.com/intel-lab-lkp/linux/commits/Souradeep-Chakrabarti/net-mana-Fix-MANA-VF-unload-when-hardware-is/20230801-203141 base: net/main patch link: https://lore.kernel.org/r/1690892953-25201-1-git-send-email-schakrabarti%40linux.microsoft.com patch subject: [PATCH V7 net] net: mana: Fix MANA VF unload when hardware is config: x86_64-allyesconfig (https://download.01.org/0day-ci/archive/20230802/202308021532.8iYkExDh-lkp@intel.com/config) compiler: gcc-12 (Debian 12.2.0-14) 12.2.0 reproduce: (https://download.01.org/0day-ci/archive/20230802/202308021532.8iYkExDh-lkp@intel.com/reproduce) If you fix the issue in a separate patch/commit (i.e. not just a new version of the same patch/commit), kindly add following tags | Reported-by: kernel test robot <lkp@intel.com> | Closes: https://lore.kernel.org/oe-kbuild-all/202308021532.8iYkExDh-lkp@intel.com/ All warnings (new ones prefixed by >>): drivers/net/ethernet/microsoft/mana/mana_en.c: In function 'mana_dealloc_queues': >> drivers/net/ethernet/microsoft/mana/mana_en.c:2398:24: warning: suggest parentheses around assignment used as truth value [-Wparentheses] 2398 | while (skb = skb_dequeue(&txq->pending_skbs)) { | ^~~ vim +2398 drivers/net/ethernet/microsoft/mana/mana_en.c 2345 2346 static int mana_dealloc_queues(struct net_device *ndev) 2347 { 2348 struct mana_port_context *apc = netdev_priv(ndev); 2349 unsigned long timeout = jiffies + 120 * HZ; 2350 struct gdma_dev *gd = apc->ac->gdma_dev; 2351 struct mana_txq *txq; 2352 struct sk_buff *skb; 2353 int i, err; 2354 u32 tsleep; 2355 2356 if (apc->port_is_up) 2357 return -EINVAL; 2358 2359 mana_chn_setxdp(apc, NULL); 2360 2361 if (gd->gdma_context->is_pf) 2362 mana_pf_deregister_filter(apc); 2363 2364 /* No packet can be transmitted now since apc->port_is_up is false. 2365 * There is still a tiny chance that mana_poll_tx_cq() can re-enable 2366 * a txq because it may not timely see apc->port_is_up being cleared 2367 * to false, but it doesn't matter since mana_start_xmit() drops any 2368 * new packets due to apc->port_is_up being false. 2369 * 2370 * Drain all the in-flight TX packets. 2371 * A timeout of 120 seconds for all the queues is used. 2372 * This will break the while loop when h/w is not responding. 2373 * This value of 120 has been decided here considering max 2374 * number of queues. 2375 */ 2376 2377 for (i = 0; i < apc->num_queues; i++) { 2378 txq = &apc->tx_qp[i].txq; 2379 tsleep = 1000; 2380 while (atomic_read(&txq->pending_sends) > 0 && 2381 time_before(jiffies, timeout)) { 2382 usleep_range(tsleep, tsleep + 1000); 2383 tsleep <<= 1; 2384 } 2385 if (atomic_read(&txq->pending_sends)) { 2386 err = pcie_flr(to_pci_dev(gd->gdma_context->dev)); 2387 if (err) { 2388 netdev_err(ndev, "flr failed %d with %d pkts pending in txq %u\n", 2389 err, atomic_read(&txq->pending_sends), 2390 txq->gdma_txq_id); 2391 } 2392 break; 2393 } 2394 } 2395 2396 for (i = 0; i < apc->num_queues; i++) { 2397 txq = &apc->tx_qp[i].txq; > 2398 while (skb = skb_dequeue(&txq->pending_skbs)) { 2399 mana_unmap_skb(skb, apc); 2400 dev_consume_skb_any(skb); 2401 } 2402 atomic_set(&txq->pending_sends, 0); 2403 } 2404 /* We're 100% sure the queues can no longer be woken up, because 2405 * we're sure now mana_poll_tx_cq() can't be running. 2406 */ 2407 2408 apc->rss_state = TRI_STATE_FALSE; 2409 err = mana_config_rss(apc, TRI_STATE_FALSE, false, false); 2410 if (err) { 2411 netdev_err(ndev, "Failed to disable vPort: %d\n", err); 2412 return err; 2413 } 2414 2415 mana_destroy_vport(apc); 2416 2417 return 0; 2418 } 2419
diff --git a/drivers/net/ethernet/microsoft/mana/mana_en.c b/drivers/net/ethernet/microsoft/mana/mana_en.c index a499e460594b..3c5552a176d0 100644 --- a/drivers/net/ethernet/microsoft/mana/mana_en.c +++ b/drivers/net/ethernet/microsoft/mana/mana_en.c @@ -8,6 +8,7 @@ #include <linux/ethtool.h> #include <linux/filter.h> #include <linux/mm.h> +#include <linux/pci.h> #include <net/checksum.h> #include <net/ip6_checksum.h> @@ -2345,9 +2346,12 @@ int mana_attach(struct net_device *ndev) static int mana_dealloc_queues(struct net_device *ndev) { struct mana_port_context *apc = netdev_priv(ndev); + unsigned long timeout = jiffies + 120 * HZ; struct gdma_dev *gd = apc->ac->gdma_dev; struct mana_txq *txq; + struct sk_buff *skb; int i, err; + u32 tsleep; if (apc->port_is_up) return -EINVAL; @@ -2363,15 +2367,40 @@ static int mana_dealloc_queues(struct net_device *ndev) * to false, but it doesn't matter since mana_start_xmit() drops any * new packets due to apc->port_is_up being false. * - * Drain all the in-flight TX packets + * Drain all the in-flight TX packets. + * A timeout of 120 seconds for all the queues is used. + * This will break the while loop when h/w is not responding. + * This value of 120 has been decided here considering max + * number of queues. */ + for (i = 0; i < apc->num_queues; i++) { txq = &apc->tx_qp[i].txq; - - while (atomic_read(&txq->pending_sends) > 0) - usleep_range(1000, 2000); + tsleep = 1000; + while (atomic_read(&txq->pending_sends) > 0 && + time_before(jiffies, timeout)) { + usleep_range(tsleep, tsleep + 1000); + tsleep <<= 1; + } + if (atomic_read(&txq->pending_sends)) { + err = pcie_flr(to_pci_dev(gd->gdma_context->dev)); + if (err) { + netdev_err(ndev, "flr failed %d with %d pkts pending in txq %u\n", + err, atomic_read(&txq->pending_sends), + txq->gdma_txq_id); + } + break; + } } + for (i = 0; i < apc->num_queues; i++) { + txq = &apc->tx_qp[i].txq; + while (skb = skb_dequeue(&txq->pending_skbs)) { + mana_unmap_skb(skb, apc); + dev_consume_skb_any(skb); + } + atomic_set(&txq->pending_sends, 0); + } /* We're 100% sure the queues can no longer be woken up, because * we're sure now mana_poll_tx_cq() can't be running. */