[v5,6/7] r8169: Coalesce mac ocp write and modify for 8125 and 8125B start to reduce spinlocks
Message ID | 20231029183600.451694-6-mirsad.todorovac@alu.unizg.hr |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:d641:0:b0:403:3b70:6f57 with SMTP id cy1csp1811588vqb; Sun, 29 Oct 2023 11:44:34 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFFM3Tqn4F5g0/mJrzZOSGvbDrpJqBRwNpX+lwWKlcUsfNjj2iZK7QGs1On9POOATMGnx8u X-Received: by 2002:a05:6871:60d:b0:1ef:cedd:5c32 with SMTP id w13-20020a056871060d00b001efcedd5c32mr1825616oan.3.1698605074554; Sun, 29 Oct 2023 11:44:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698605074; cv=none; d=google.com; s=arc-20160816; b=qstjAiJYEgzW4nMscRPO7BmxFmqi93pSYmVpPUqyP1stSzD/O4C5p1mzzhn3Zd4Opa 8QWeyddiAN349zuHo9kl9HfeqvHEK2ikqbX/DwTOyb93RCDwhN/vgvUntQlPaAWmesdh iknQEuw7Gf/HhFyMVbIN2KMlbOWe2N1u5fETYQd2MdslW4OHaJmHPJUPGnolBbW1/MW2 JBZ0pqOeVeeXaMTuyDbeYFDhigJEvB5ffi4wCm/cHEKyHxVO71r8BpX+0cT+kZgcXowI EWosqE5DEFW95n6JAphBW85Z3M66EZAVlFnULO+GIYGYiCFw1eRh++jh+in3SkOiQaOb jVvA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature:dkim-signature; bh=Asl/6IUeLXRnDB289XIVsKuj7d8rg64WKlxgVsYJA7s=; fh=SsNJs69sMYlYTeWQYB3Y6uFVnk1BYEiSEalBcmc5mfg=; b=rPNWZpx5wU/APtWkd//Vth00mA7DY+pHWsPcEFYn8oEHPVqWixU79OqEuKwnpK/r5e xSuSGuLHpPzgVOWRowxg4wRwrbJ2uJaZTfsW+TopsvQJNwsZBDb/+fW0o05IJxyXIskh cT8gB+aJLkiI5Yt56/qScjHho0Fbh5Z0oexqwW6J3qOAntjxFqdze6naUCIcmOr/dCWF f9UxwlfJTwsdLtU9qaQlTBlwGT5E/vwnP3f3Gn7dKl2ygQgxNGe37GITj8bulfa6BYz2 1QaYfX+VMwfcWBdruQ/h6FyuXrc7O/o8CNI8whZnXBS1XLcXQ5L86+0uTEfV/mjT73CI iqwQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=I4oglD+g; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=VGwNHyBR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alu.unizg.hr Received: from pete.vger.email (pete.vger.email. [23.128.96.36]) by mx.google.com with ESMTPS id n5-20020a632705000000b005a073e0cc9asi3994262pgn.803.2023.10.29.11.44.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 29 Oct 2023 11:44:34 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) client-ip=23.128.96.36; Authentication-Results: mx.google.com; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=I4oglD+g; dkim=fail header.i=@alu.unizg.hr header.s=mail header.b=VGwNHyBR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.36 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alu.unizg.hr Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by pete.vger.email (Postfix) with ESMTP id EB32C80621F9; Sun, 29 Oct 2023 11:44:30 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at pete.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230345AbjJ2SoD (ORCPT <rfc822;fengqi706@gmail.com> + 30 others); Sun, 29 Oct 2023 14:44:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53372 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230349AbjJ2SoC (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Sun, 29 Oct 2023 14:44:02 -0400 Received: from domac.alu.hr (domac.alu.unizg.hr [IPv6:2001:b68:2:2800::3]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5A8D8BD; Sun, 29 Oct 2023 11:43:59 -0700 (PDT) Received: from localhost (localhost [127.0.0.1]) by domac.alu.hr (Postfix) with ESMTP id 16B6860173; Sun, 29 Oct 2023 19:43:58 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=alu.unizg.hr; s=mail; t=1698605038; bh=kTrKI+RYDHwLvsOjwX6EC4uYyc8dF8fc3KgwnN4pF48=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=I4oglD+gqpjI8GSCMer9UN+MuG+hXIz8uXZFJJEw3nnWIkPXGyW2n8wjDEb8RgUjx huZ1sKRup2wQgqUoj/BZClV74pLhoxZrxz2PNI0h67YQ3cq42uN5qy2/G8+Kec6TXW rc6LkYoCauAeYpYSERe3YJ34vehPhQvauhbI7UNto+Dpvg+j67Q2neuVMOyCSwYamG FrXZ1HsI+wlpjYeGfRyPAchQMlqG+tIBET+3cdqTBAyw0b9ZOCGWZvDqrr2N55UlhN 4gkdROdFyS2I7DSllbDltaSNFsL9nc4wutWOZtMnKIVGBWgWqqL5YvfqjbJpB1wfeM pmNxhR48JCzjw== X-Virus-Scanned: Debian amavisd-new at domac.alu.hr Received: from domac.alu.hr ([127.0.0.1]) by localhost (domac.alu.hr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yBSS37-Hzjdw; Sun, 29 Oct 2023 19:43:55 +0100 (CET) Received: from defiant.home (78-3-40-247.adsl.net.t-com.hr [78.3.40.247]) by domac.alu.hr (Postfix) with ESMTPSA id 6A74A60182; Sun, 29 Oct 2023 19:43:55 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=alu.unizg.hr; s=mail; t=1698605035; bh=kTrKI+RYDHwLvsOjwX6EC4uYyc8dF8fc3KgwnN4pF48=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=VGwNHyBRNsEywjflWi+92J6548s7A+Jtky6QkTblA5IKLHHu2reR8JaahD3hERPhf 9kYXlgVxy6ZOZhEg4HUckw7qMcxIggx25rgp6A9M+dznlRwgHPwTlleEv5BWcz+UQr 2aGJKQGNe+tB1oR2bGB3u3sHhAfsd9JA4DAv6Gwi3bzy1WM2BV1tDEQmZF5nugXFzx 0dczvI/gF8Oh57W3h7gUBOAL2J2YUMJmEZE0yzSMzkWZhA+oCoVTOqjTh2ayO8AQo3 Z+Tv1RPONMRfgPzYFjuI586MJBmSV5fkZpM8oXZ0DoXsyGntAeVnGbcNpvRb8Gkp9u AIeH/GtHsWUTQ== From: Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr> To: Jason Gunthorpe <jgg@ziepe.ca>, Joerg Roedel <jroedel@suse.de>, Lu Baolu <baolu.lu@linux.intel.com>, iommu@lists.linux.dev, linux-kernel@vger.kernel.org, netdev@vger.kernel.org Cc: Joerg Roedel <joro@8bytes.org>, Will Deacon <will@kernel.org>, Robin Murphy <robin.murphy@arm.com>, Heiner Kallweit <hkallweit1@gmail.com>, nic_swsd@realtek.com, "David S. Miller" <davem@davemloft.net>, Eric Dumazet <edumazet@google.com>, Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>, Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr>, Marco Elver <elver@google.com> Subject: [PATCH v5 6/7] r8169: Coalesce mac ocp write and modify for 8125 and 8125B start to reduce spinlocks Date: Sun, 29 Oct 2023 19:36:05 +0100 Message-Id: <20231029183600.451694-6-mirsad.todorovac@alu.unizg.hr> X-Mailer: git-send-email 2.34.1 In-Reply-To: <20231029183600.451694-1-mirsad.todorovac@alu.unizg.hr> References: <20231029183600.451694-1-mirsad.todorovac@alu.unizg.hr> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-0.6 required=5.0 tests=DKIM_INVALID,DKIM_SIGNED, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on pete.vger.email Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (pete.vger.email [0.0.0.0]); Sun, 29 Oct 2023 11:44:31 -0700 (PDT) X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1781116514604537246 X-GMAIL-MSGID: 1781116514604537246 |
Series |
[v5,1/7] r8169: Add r8169_mac_ocp_(write|modify)_seq helpers to reduce spinlock contention
|
|
Commit Message
Mirsad Todorovac
Oct. 29, 2023, 6:36 p.m. UTC
Repeated calls to r8168_mac_ocp_write() and r8168_mac_ocp_modify() in
the startup of 8125 and 8125B involve implicit spin_lock_irqsave() and
spin_unlock_irqrestore() on each invocation.
Coalesced with the corresponding helpers r8168_mac_ocp_write_seq() and
r8168_mac_ocp_modify_seq() into sequential write or modidy with a sinqle
pair of spin_lock_irqsave() and spin_unlock_irqrestore(), these calls
reduce overall lock contention.
Fixes: f1bce4ad2f1ce ("r8169: add support for RTL8125")
Fixes: 0439297be9511 ("r8169: add support for RTL8125B")
Cc: Heiner Kallweit <hkallweit1@gmail.com>
Cc: Marco Elver <elver@google.com>
Cc: nic_swsd@realtek.com
Cc: "David S. Miller" <davem@davemloft.net>
Cc: Eric Dumazet <edumazet@google.com>
Cc: Jakub Kicinski <kuba@kernel.org>
Cc: Paolo Abeni <pabeni@redhat.com>
Cc: netdev@vger.kernel.org
Cc: linux-kernel@vger.kernel.org
Link: https://lore.kernel.org/lkml/20231028005153.2180411-1-mirsad.todorovac@alu.unizg.hr/
Link: https://lore.kernel.org/lkml/20231028110459.2644926-1-mirsad.todorovac@alu.unizg.hr/
Signed-off-by: Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr>
---
v5:
added unlocked primitives to allow mac ocs modify grouping
applied coalescing of mac ocp writes/modifies for 8168ep and 8117
some formatting fixes to please checkpatch.pl
v4:
fixed complaints as advised by Heiner and checkpatch.pl
split the patch into five sections to be more easily manipulated and reviewed
introduced r8168_mac_ocp_write_seq()
applied coalescing of mac ocp writes/modifies for 8168H, 8125 and 8125B
v3:
removed register/mask pair array sentinels, so using ARRAY_SIZE().
avoided duplication of RTL_W32() call code as advised by Heiner.
drivers/net/ethernet/realtek/r8169_main.c | 75 +++++++++++++----------
1 file changed, 44 insertions(+), 31 deletions(-)
Comments
On 29.10.2023 19:36, Mirsad Goran Todorovac wrote: > Repeated calls to r8168_mac_ocp_write() and r8168_mac_ocp_modify() in > the startup of 8125 and 8125B involve implicit spin_lock_irqsave() and > spin_unlock_irqrestore() on each invocation. > > Coalesced with the corresponding helpers r8168_mac_ocp_write_seq() and > r8168_mac_ocp_modify_seq() into sequential write or modidy with a sinqle > pair of spin_lock_irqsave() and spin_unlock_irqrestore(), these calls > reduce overall lock contention. > > Fixes: f1bce4ad2f1ce ("r8169: add support for RTL8125") > Fixes: 0439297be9511 ("r8169: add support for RTL8125B") > Cc: Heiner Kallweit <hkallweit1@gmail.com> > Cc: Marco Elver <elver@google.com> > Cc: nic_swsd@realtek.com > Cc: "David S. Miller" <davem@davemloft.net> > Cc: Eric Dumazet <edumazet@google.com> > Cc: Jakub Kicinski <kuba@kernel.org> > Cc: Paolo Abeni <pabeni@redhat.com> > Cc: netdev@vger.kernel.org > Cc: linux-kernel@vger.kernel.org > Link: https://lore.kernel.org/lkml/20231028005153.2180411-1-mirsad.todorovac@alu.unizg.hr/ > Link: https://lore.kernel.org/lkml/20231028110459.2644926-1-mirsad.todorovac@alu.unizg.hr/ > Signed-off-by: Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr> > --- > v5: > added unlocked primitives to allow mac ocs modify grouping > applied coalescing of mac ocp writes/modifies for 8168ep and 8117 > some formatting fixes to please checkpatch.pl > > v4: > fixed complaints as advised by Heiner and checkpatch.pl > split the patch into five sections to be more easily manipulated and reviewed > introduced r8168_mac_ocp_write_seq() > applied coalescing of mac ocp writes/modifies for 8168H, 8125 and 8125B > > v3: > removed register/mask pair array sentinels, so using ARRAY_SIZE(). > avoided duplication of RTL_W32() call code as advised by Heiner. > > drivers/net/ethernet/realtek/r8169_main.c | 75 +++++++++++++---------- > 1 file changed, 44 insertions(+), 31 deletions(-) > > diff --git a/drivers/net/ethernet/realtek/r8169_main.c b/drivers/net/ethernet/realtek/r8169_main.c > index 50fbacb05953..0778cd0ba2e0 100644 > --- a/drivers/net/ethernet/realtek/r8169_main.c > +++ b/drivers/net/ethernet/realtek/r8169_main.c > @@ -3553,6 +3553,28 @@ DECLARE_RTL_COND(rtl_mac_ocp_e00e_cond) > > static void rtl_hw_start_8125_common(struct rtl8169_private *tp) > { > + static const struct e_info_regmaskset e_info_8125_common_1[] = { > + { 0xd3e2, 0x0fff, 0x03a9 }, > + { 0xd3e4, 0x00ff, 0x0000 }, > + { 0xe860, 0x0000, 0x0080 }, > + }; > + > + static const struct e_info_regmaskset e_info_8125_common_2[] = { > + { 0xc0b4, 0x0000, 0x000c }, > + { 0xeb6a, 0x00ff, 0x0033 }, > + { 0xeb50, 0x03e0, 0x0040 }, > + { 0xe056, 0x00f0, 0x0030 }, > + { 0xe040, 0x1000, 0x0000 }, > + { 0xea1c, 0x0003, 0x0001 }, > + { 0xe0c0, 0x4f0f, 0x4403 }, > + { 0xe052, 0x0080, 0x0068 }, > + { 0xd430, 0x0fff, 0x047f }, > + { 0xea1c, 0x0004, 0x0000 }, > + { 0xeb54, 0x0000, 0x0001 }, > + }; > + > + unsigned long flags; > + > rtl_pcie_state_l2l3_disable(tp); > > RTL_W16(tp, 0x382, 0x221b); > @@ -3560,47 +3582,38 @@ static void rtl_hw_start_8125_common(struct rtl8169_private *tp) > RTL_W16(tp, 0x4800, 0); > > /* disable UPS */ > - r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); > + > + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); > + __r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); > > RTL_W8(tp, Config1, RTL_R8(tp, Config1) & ~0x10); > > - r8168_mac_ocp_write(tp, 0xc140, 0xffff); > - r8168_mac_ocp_write(tp, 0xc142, 0xffff); > + __r8168_mac_ocp_write(tp, 0xc140, 0xffff); > + __r8168_mac_ocp_write(tp, 0xc142, 0xffff); > > - r8168_mac_ocp_modify(tp, 0xd3e2, 0x0fff, 0x03a9); > - r8168_mac_ocp_modify(tp, 0xd3e4, 0x00ff, 0x0000); > - r8168_mac_ocp_modify(tp, 0xe860, 0x0000, 0x0080); > + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_1); > > /* disable new tx descriptor format */ > - r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); > + __r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); > > - if (tp->mac_version == RTL_GIGA_MAC_VER_63) > - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); > - else > - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); > + if (tp->mac_version == RTL_GIGA_MAC_VER_63) { > + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); > + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); > + } else { > + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); > + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); > + } > + > + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_2); > + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); > > - if (tp->mac_version == RTL_GIGA_MAC_VER_63) > - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); > - else > - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); > - > - r8168_mac_ocp_modify(tp, 0xc0b4, 0x0000, 0x000c); > - r8168_mac_ocp_modify(tp, 0xeb6a, 0x00ff, 0x0033); > - r8168_mac_ocp_modify(tp, 0xeb50, 0x03e0, 0x0040); > - r8168_mac_ocp_modify(tp, 0xe056, 0x00f0, 0x0030); > - r8168_mac_ocp_modify(tp, 0xe040, 0x1000, 0x0000); > - r8168_mac_ocp_modify(tp, 0xea1c, 0x0003, 0x0001); > - r8168_mac_ocp_modify(tp, 0xe0c0, 0x4f0f, 0x4403); > - r8168_mac_ocp_modify(tp, 0xe052, 0x0080, 0x0068); > - r8168_mac_ocp_modify(tp, 0xd430, 0x0fff, 0x047f); > - > - r8168_mac_ocp_modify(tp, 0xea1c, 0x0004, 0x0000); > - r8168_mac_ocp_modify(tp, 0xeb54, 0x0000, 0x0001); > udelay(1); > - r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); > - RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); > > - r8168_mac_ocp_write(tp, 0xe098, 0xc302); > + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); > + __r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); > + RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); > + __r8168_mac_ocp_write(tp, 0xe098, 0xc302); > + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); > > rtl_loop_wait_low(tp, &rtl_mac_ocp_e00e_cond, 1000, 10); > All this manual locking and unlocking makes the code harder to read and more error-prone. Maybe, as a rule of thumb: If you can replace a block with more than 10 mac ocp ops, then fine with me.
On 10/30/23 15:02, Heiner Kallweit wrote: > On 29.10.2023 19:36, Mirsad Goran Todorovac wrote: >> Repeated calls to r8168_mac_ocp_write() and r8168_mac_ocp_modify() in >> the startup of 8125 and 8125B involve implicit spin_lock_irqsave() and >> spin_unlock_irqrestore() on each invocation. >> >> Coalesced with the corresponding helpers r8168_mac_ocp_write_seq() and >> r8168_mac_ocp_modify_seq() into sequential write or modidy with a sinqle >> pair of spin_lock_irqsave() and spin_unlock_irqrestore(), these calls >> reduce overall lock contention. >> >> Fixes: f1bce4ad2f1ce ("r8169: add support for RTL8125") >> Fixes: 0439297be9511 ("r8169: add support for RTL8125B") >> Cc: Heiner Kallweit <hkallweit1@gmail.com> >> Cc: Marco Elver <elver@google.com> >> Cc: nic_swsd@realtek.com >> Cc: "David S. Miller" <davem@davemloft.net> >> Cc: Eric Dumazet <edumazet@google.com> >> Cc: Jakub Kicinski <kuba@kernel.org> >> Cc: Paolo Abeni <pabeni@redhat.com> >> Cc: netdev@vger.kernel.org >> Cc: linux-kernel@vger.kernel.org >> Link: https://lore.kernel.org/lkml/20231028005153.2180411-1-mirsad.todorovac@alu.unizg.hr/ >> Link: https://lore.kernel.org/lkml/20231028110459.2644926-1-mirsad.todorovac@alu.unizg.hr/ >> Signed-off-by: Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr> >> --- >> v5: >> added unlocked primitives to allow mac ocs modify grouping >> applied coalescing of mac ocp writes/modifies for 8168ep and 8117 >> some formatting fixes to please checkpatch.pl >> >> v4: >> fixed complaints as advised by Heiner and checkpatch.pl >> split the patch into five sections to be more easily manipulated and reviewed >> introduced r8168_mac_ocp_write_seq() >> applied coalescing of mac ocp writes/modifies for 8168H, 8125 and 8125B >> >> v3: >> removed register/mask pair array sentinels, so using ARRAY_SIZE(). >> avoided duplication of RTL_W32() call code as advised by Heiner. >> >> drivers/net/ethernet/realtek/r8169_main.c | 75 +++++++++++++---------- >> 1 file changed, 44 insertions(+), 31 deletions(-) >> >> diff --git a/drivers/net/ethernet/realtek/r8169_main.c b/drivers/net/ethernet/realtek/r8169_main.c >> index 50fbacb05953..0778cd0ba2e0 100644 >> --- a/drivers/net/ethernet/realtek/r8169_main.c >> +++ b/drivers/net/ethernet/realtek/r8169_main.c >> @@ -3553,6 +3553,28 @@ DECLARE_RTL_COND(rtl_mac_ocp_e00e_cond) >> >> static void rtl_hw_start_8125_common(struct rtl8169_private *tp) >> { >> + static const struct e_info_regmaskset e_info_8125_common_1[] = { >> + { 0xd3e2, 0x0fff, 0x03a9 }, >> + { 0xd3e4, 0x00ff, 0x0000 }, >> + { 0xe860, 0x0000, 0x0080 }, >> + }; >> + >> + static const struct e_info_regmaskset e_info_8125_common_2[] = { >> + { 0xc0b4, 0x0000, 0x000c }, >> + { 0xeb6a, 0x00ff, 0x0033 }, >> + { 0xeb50, 0x03e0, 0x0040 }, >> + { 0xe056, 0x00f0, 0x0030 }, >> + { 0xe040, 0x1000, 0x0000 }, >> + { 0xea1c, 0x0003, 0x0001 }, >> + { 0xe0c0, 0x4f0f, 0x4403 }, >> + { 0xe052, 0x0080, 0x0068 }, >> + { 0xd430, 0x0fff, 0x047f }, >> + { 0xea1c, 0x0004, 0x0000 }, >> + { 0xeb54, 0x0000, 0x0001 }, >> + }; >> + >> + unsigned long flags; >> + >> rtl_pcie_state_l2l3_disable(tp); >> >> RTL_W16(tp, 0x382, 0x221b); >> @@ -3560,47 +3582,38 @@ static void rtl_hw_start_8125_common(struct rtl8169_private *tp) >> RTL_W16(tp, 0x4800, 0); >> >> /* disable UPS */ >> - r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); >> + >> + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); >> + __r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); >> >> RTL_W8(tp, Config1, RTL_R8(tp, Config1) & ~0x10); >> >> - r8168_mac_ocp_write(tp, 0xc140, 0xffff); >> - r8168_mac_ocp_write(tp, 0xc142, 0xffff); >> + __r8168_mac_ocp_write(tp, 0xc140, 0xffff); >> + __r8168_mac_ocp_write(tp, 0xc142, 0xffff); >> >> - r8168_mac_ocp_modify(tp, 0xd3e2, 0x0fff, 0x03a9); >> - r8168_mac_ocp_modify(tp, 0xd3e4, 0x00ff, 0x0000); >> - r8168_mac_ocp_modify(tp, 0xe860, 0x0000, 0x0080); >> + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_1); >> >> /* disable new tx descriptor format */ >> - r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); >> + __r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); >> >> - if (tp->mac_version == RTL_GIGA_MAC_VER_63) >> - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); >> - else >> - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); >> + if (tp->mac_version == RTL_GIGA_MAC_VER_63) { >> + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); >> + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); >> + } else { >> + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); >> + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); >> + } >> + >> + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_2); >> + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); >> >> - if (tp->mac_version == RTL_GIGA_MAC_VER_63) >> - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); >> - else >> - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); >> - >> - r8168_mac_ocp_modify(tp, 0xc0b4, 0x0000, 0x000c); >> - r8168_mac_ocp_modify(tp, 0xeb6a, 0x00ff, 0x0033); >> - r8168_mac_ocp_modify(tp, 0xeb50, 0x03e0, 0x0040); >> - r8168_mac_ocp_modify(tp, 0xe056, 0x00f0, 0x0030); >> - r8168_mac_ocp_modify(tp, 0xe040, 0x1000, 0x0000); >> - r8168_mac_ocp_modify(tp, 0xea1c, 0x0003, 0x0001); >> - r8168_mac_ocp_modify(tp, 0xe0c0, 0x4f0f, 0x4403); >> - r8168_mac_ocp_modify(tp, 0xe052, 0x0080, 0x0068); >> - r8168_mac_ocp_modify(tp, 0xd430, 0x0fff, 0x047f); >> - >> - r8168_mac_ocp_modify(tp, 0xea1c, 0x0004, 0x0000); >> - r8168_mac_ocp_modify(tp, 0xeb54, 0x0000, 0x0001); [1] I think we have a candidate for a squeeze here in this driver. >> udelay(1); >> - r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); >> - RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); >> >> - r8168_mac_ocp_write(tp, 0xe098, 0xc302); >> + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); >> + __r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); >> + RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); >> + __r8168_mac_ocp_write(tp, 0xe098, 0xc302); >> + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); >> >> rtl_loop_wait_low(tp, &rtl_mac_ocp_e00e_cond, 1000, 10); >> > > All this manual locking and unlocking makes the code harder > to read and more error-prone. Maybe, as a rule of thumb: > If you can replace a block with more than 10 mac ocp ops, > then fine with me As I worked with another German developer, Mr. Frank Heckenbach from the GNU Pascal project, I know that Germans are pedantic and reliable :-) If this rtl_hw_start_8125_common() is called only once, then maybe every memory bus cycle isn't worth saving, and then maybe the additional complexity isn't worth adding (but it was fun doing, and it works with my NIC). AFAIK, a spin_lock_irqsave()/spin_unlock_irqrestore() isn't a free lunch as you know, and I read from the manuals that on modern CPUs a locked ADD $0, -128(%esp) or something takes about 50 clock cycles, in which all cores have to wait. Doing that in storm of 10 lock/unlock pairs amounts to 500 cycles or 125 ns in the best case on a 4 GHz CPU. But I trust that you as the maintainer have the big picture and greater insight in the actual hw. Elon Musk said that the greatest error in engineering is optimising stuff that never should have been written. As I like my RTL 8125b NIC to be fast (TM), I am not giving up on it. Just recalled that you mentioned that it supports RSS. Is it feasible to add it to the r8169 driver? It is OK for me that we have a formal and less formal mode of communication and switch between them. Thank you for all your quick reviews, much obliged, thank you for partially accepting the patches, but I think I need to sleep over it before submitting a new version. Regards, Mirsad Todorovac
On 30.10.2023 16:02, Mirsad Todorovac wrote: > > > On 10/30/23 15:02, Heiner Kallweit wrote: >> On 29.10.2023 19:36, Mirsad Goran Todorovac wrote: >>> Repeated calls to r8168_mac_ocp_write() and r8168_mac_ocp_modify() in >>> the startup of 8125 and 8125B involve implicit spin_lock_irqsave() and >>> spin_unlock_irqrestore() on each invocation. >>> >>> Coalesced with the corresponding helpers r8168_mac_ocp_write_seq() and >>> r8168_mac_ocp_modify_seq() into sequential write or modidy with a sinqle >>> pair of spin_lock_irqsave() and spin_unlock_irqrestore(), these calls >>> reduce overall lock contention. >>> >>> Fixes: f1bce4ad2f1ce ("r8169: add support for RTL8125") >>> Fixes: 0439297be9511 ("r8169: add support for RTL8125B") >>> Cc: Heiner Kallweit <hkallweit1@gmail.com> >>> Cc: Marco Elver <elver@google.com> >>> Cc: nic_swsd@realtek.com >>> Cc: "David S. Miller" <davem@davemloft.net> >>> Cc: Eric Dumazet <edumazet@google.com> >>> Cc: Jakub Kicinski <kuba@kernel.org> >>> Cc: Paolo Abeni <pabeni@redhat.com> >>> Cc: netdev@vger.kernel.org >>> Cc: linux-kernel@vger.kernel.org >>> Link: https://lore.kernel.org/lkml/20231028005153.2180411-1-mirsad.todorovac@alu.unizg.hr/ >>> Link: https://lore.kernel.org/lkml/20231028110459.2644926-1-mirsad.todorovac@alu.unizg.hr/ >>> Signed-off-by: Mirsad Goran Todorovac <mirsad.todorovac@alu.unizg.hr> >>> --- >>> v5: >>> added unlocked primitives to allow mac ocs modify grouping >>> applied coalescing of mac ocp writes/modifies for 8168ep and 8117 >>> some formatting fixes to please checkpatch.pl >>> >>> v4: >>> fixed complaints as advised by Heiner and checkpatch.pl >>> split the patch into five sections to be more easily manipulated and reviewed >>> introduced r8168_mac_ocp_write_seq() >>> applied coalescing of mac ocp writes/modifies for 8168H, 8125 and 8125B >>> >>> v3: >>> removed register/mask pair array sentinels, so using ARRAY_SIZE(). >>> avoided duplication of RTL_W32() call code as advised by Heiner. >>> >>> drivers/net/ethernet/realtek/r8169_main.c | 75 +++++++++++++---------- >>> 1 file changed, 44 insertions(+), 31 deletions(-) >>> >>> diff --git a/drivers/net/ethernet/realtek/r8169_main.c b/drivers/net/ethernet/realtek/r8169_main.c >>> index 50fbacb05953..0778cd0ba2e0 100644 >>> --- a/drivers/net/ethernet/realtek/r8169_main.c >>> +++ b/drivers/net/ethernet/realtek/r8169_main.c >>> @@ -3553,6 +3553,28 @@ DECLARE_RTL_COND(rtl_mac_ocp_e00e_cond) >>> static void rtl_hw_start_8125_common(struct rtl8169_private *tp) >>> { >>> + static const struct e_info_regmaskset e_info_8125_common_1[] = { >>> + { 0xd3e2, 0x0fff, 0x03a9 }, >>> + { 0xd3e4, 0x00ff, 0x0000 }, >>> + { 0xe860, 0x0000, 0x0080 }, >>> + }; >>> + >>> + static const struct e_info_regmaskset e_info_8125_common_2[] = { >>> + { 0xc0b4, 0x0000, 0x000c }, >>> + { 0xeb6a, 0x00ff, 0x0033 }, >>> + { 0xeb50, 0x03e0, 0x0040 }, >>> + { 0xe056, 0x00f0, 0x0030 }, >>> + { 0xe040, 0x1000, 0x0000 }, >>> + { 0xea1c, 0x0003, 0x0001 }, >>> + { 0xe0c0, 0x4f0f, 0x4403 }, >>> + { 0xe052, 0x0080, 0x0068 }, >>> + { 0xd430, 0x0fff, 0x047f }, >>> + { 0xea1c, 0x0004, 0x0000 }, >>> + { 0xeb54, 0x0000, 0x0001 }, >>> + }; >>> + >>> + unsigned long flags; >>> + >>> rtl_pcie_state_l2l3_disable(tp); >>> RTL_W16(tp, 0x382, 0x221b); >>> @@ -3560,47 +3582,38 @@ static void rtl_hw_start_8125_common(struct rtl8169_private *tp) >>> RTL_W16(tp, 0x4800, 0); >>> /* disable UPS */ >>> - r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); >>> + >>> + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); >>> + __r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); >>> RTL_W8(tp, Config1, RTL_R8(tp, Config1) & ~0x10); >>> - r8168_mac_ocp_write(tp, 0xc140, 0xffff); >>> - r8168_mac_ocp_write(tp, 0xc142, 0xffff); >>> + __r8168_mac_ocp_write(tp, 0xc140, 0xffff); >>> + __r8168_mac_ocp_write(tp, 0xc142, 0xffff); >>> - r8168_mac_ocp_modify(tp, 0xd3e2, 0x0fff, 0x03a9); >>> - r8168_mac_ocp_modify(tp, 0xd3e4, 0x00ff, 0x0000); >>> - r8168_mac_ocp_modify(tp, 0xe860, 0x0000, 0x0080); >>> + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_1); >>> /* disable new tx descriptor format */ >>> - r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); >>> + __r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); >>> - if (tp->mac_version == RTL_GIGA_MAC_VER_63) >>> - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); >>> - else >>> - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); >>> + if (tp->mac_version == RTL_GIGA_MAC_VER_63) { >>> + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); >>> + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); >>> + } else { >>> + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); >>> + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); >>> + } >>> + >>> + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_2); >>> + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); >>> - if (tp->mac_version == RTL_GIGA_MAC_VER_63) >>> - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); >>> - else >>> - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); >>> - >>> - r8168_mac_ocp_modify(tp, 0xc0b4, 0x0000, 0x000c); >>> - r8168_mac_ocp_modify(tp, 0xeb6a, 0x00ff, 0x0033); >>> - r8168_mac_ocp_modify(tp, 0xeb50, 0x03e0, 0x0040); >>> - r8168_mac_ocp_modify(tp, 0xe056, 0x00f0, 0x0030); >>> - r8168_mac_ocp_modify(tp, 0xe040, 0x1000, 0x0000); >>> - r8168_mac_ocp_modify(tp, 0xea1c, 0x0003, 0x0001); >>> - r8168_mac_ocp_modify(tp, 0xe0c0, 0x4f0f, 0x4403); >>> - r8168_mac_ocp_modify(tp, 0xe052, 0x0080, 0x0068); >>> - r8168_mac_ocp_modify(tp, 0xd430, 0x0fff, 0x047f); >>> - >>> - r8168_mac_ocp_modify(tp, 0xea1c, 0x0004, 0x0000); >>> - r8168_mac_ocp_modify(tp, 0xeb54, 0x0000, 0x0001); > > [1] I think we have a candidate for a squeeze here in this driver. > >>> udelay(1); >>> - r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); >>> - RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); >>> - r8168_mac_ocp_write(tp, 0xe098, 0xc302); >>> + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); >>> + __r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); >>> + RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); >>> + __r8168_mac_ocp_write(tp, 0xe098, 0xc302); >>> + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); >>> rtl_loop_wait_low(tp, &rtl_mac_ocp_e00e_cond, 1000, 10); >>> >> >> All this manual locking and unlocking makes the code harder >> to read and more error-prone. Maybe, as a rule of thumb: >> If you can replace a block with more than 10 mac ocp ops, >> then fine with me > As I worked with another German developer, Mr. Frank Heckenbach from the GNU Pascal project, > I know that Germans are pedantic and reliable :-) > > If this rtl_hw_start_8125_common() is called only once, then maybe every memory bus cycle > isn't worth saving, and then maybe the additional complexity isn't worth adding (but it > was fun doing, and it works with my NIC). > > AFAIK, a spin_lock_irqsave()/spin_unlock_irqrestore() isn't a free lunch as you know, and I read > from the manuals that on modern CPUs a locked ADD $0, -128(%esp) or something takes about 50 > clock cycles, in which all cores have to wait. > > Doing that in storm of 10 lock/unlock pairs amounts to 500 cycles or 125 ns in the best case > on a 4 GHz CPU. > > But I trust that you as the maintainer have the big picture and greater insight in the actual hw. > Big picture: maybe, as I've been working on r8169 for about 5 yrs Greater insight in the actual hw: not really, because Realtek doesn't publish datasheets and errata information. I just know the history of a lot of needed quirks in r8169, and I can recall what we tried to improve and had to roll back because users had problems. Most famous example is ASPM L1 + sub-state support. ASPM with r8169-driven NIC versions has a long history of users facing slow network connectivity / tx timeouts / complete NIC stalls. And it's still not solved with the newest NIC versions. But maybe not only Realtek is to blame. Basically every consumer mainboard comes with a Realtek NIC, chipset issues and BIOS bugs contribute to the mess. > Elon Musk said that the greatest error in engineering is optimising stuff that never should > have been written. > > As I like my RTL 8125b NIC to be fast (TM), I am not giving up on it. > Does this mean that you're not achieving the 2500Mbps line rate with r8169 as-is? > Just recalled that you mentioned that it supports RSS. Is it feasible to add it to the r8169 > driver? > Feasible: definitely, as you "just" have to take the RSS-related code from r8125 and adjust it to meet mainline code quality criteria Question is which actual benefit it brings, and whether it's worth the additional complexity. As a starting point you could compare performance KPI's when using r8169 vs. r8125. > It is OK for me that we have a formal and less formal mode of communication and switch between > them. > > Thank you for all your quick reviews, much obliged, thank you for partially accepting the patches, > but I think I need to sleep over it before submitting a new version. > > Regards, > Mirsad Todorovac
Hello Mirsad, [most CCs dropped] I'm responding to your comment quoted below. It caught eyes of me who happens to be a reviewer of LKMM and a LaTeX advisor to perfbook. On Mon, 30 Oct 2023 16:02:28 +0100, Mirsad Todorovac wrote: > On 10/30/23 15:02, Heiner Kallweit wrote: [...] >> >> All this manual locking and unlocking makes the code harder >> to read and more error-prone. Maybe, as a rule of thumb: >> If you can replace a block with more than 10 mac ocp ops, >> then fine with me > As I worked with another German developer, Mr. Frank Heckenbach from the GNU Pascal project, > I know that Germans are pedantic and reliable :-) > > If this rtl_hw_start_8125_common() is called only once, then maybe every memory bus cycle > isn't worth saving, and then maybe the additional complexity isn't worth adding (but it > was fun doing, and it works with my NIC). > > AFAIK, a spin_lock_irqsave()/spin_unlock_irqrestore() isn't a free lunch as you know, and I read > from the manuals that on modern CPUs a locked ADD $0, -128(%esp) or something takes about 50 > clock cycles, in which all cores have to wait. Do you mean, while one of x86 CPUs is executing "lock; addl $0, -4(%esp)" aka smp_mb(), bus locking prevents all the other CPUs in the system connected to the bus from doing any memory accesses ??? If it is the case, I believe you are missing the state of the art optimization of x86 memory system architecture, where most of atomic operations are done using cache locking. Bus locking is used only when it is unavoidable. Hint: A short introduction can be found at stackoverflow.com [1]. Quote of (then) section 7.1.4 from Intel's SDM vol 3A in the answer should suffice. [1]: https://stackoverflow.com/questions/3339141/x86-lock-question-on-multi-core-cpus A reachable link to Intel SDM should be somewhere in perfbook's bibliography. The relevant section in Vol 3A is now "2.8.5 Controlling the Processor". Hope this helps, Akira > > Doing that in storm of 10 lock/unlock pairs amounts to 500 cycles or 125 ns in the best case > on a 4 GHz CPU. > > But I trust that you as the maintainer have the big picture and greater insight in the actual hw.
Hello Akira, On 10/31/23 09:23, Akira Yokosawa wrote: > Hello Mirsad, > > [most CCs dropped] > > I'm responding to your comment quoted below. It caught eyes of me > who happens to be a reviewer of LKMM and a LaTeX advisor to perfbook. > > On Mon, 30 Oct 2023 16:02:28 +0100, Mirsad Todorovac wrote: >> On 10/30/23 15:02, Heiner Kallweit wrote: > [...] >>> >>> All this manual locking and unlocking makes the code harder >>> to read and more error-prone. Maybe, as a rule of thumb: >>> If you can replace a block with more than 10 mac ocp ops, >>> then fine with me >> As I worked with another German developer, Mr. Frank Heckenbach from the GNU Pascal project, >> I know that Germans are pedantic and reliable :-) >> >> If this rtl_hw_start_8125_common() is called only once, then maybe every memory bus cycle >> isn't worth saving, and then maybe the additional complexity isn't worth adding (but it >> was fun doing, and it works with my NIC). >> >> AFAIK, a spin_lock_irqsave()/spin_unlock_irqrestore() isn't a free lunch as you know, and I read >> from the manuals that on modern CPUs a locked ADD $0, -128(%esp) or something takes about 50 >> clock cycles, in which all cores have to wait. > > Do you mean, while one of x86 CPUs is executing "lock; addl $0, -4(%esp)" > aka smp_mb(), bus locking prevents all the other CPUs in the system > connected to the bus from doing any memory accesses ??? > > If it is the case, I believe you are missing the state of the art > optimization of x86 memory system architecture, where most of atomic > operations are done using cache locking. Bus locking is used only > when it is unavoidable. > > Hint: A short introduction can be found at stackoverflow.com [1]. > Quote of (then) section 7.1.4 from Intel's SDM vol 3A in the answer > should suffice. > > [1]: https://stackoverflow.com/questions/3339141/x86-lock-question-on-multi-core-cpus > > A reachable link to Intel SDM should be somewhere in perfbook's bibliography. > The relevant section in Vol 3A is now "2.8.5 Controlling the Processor". > > Hope this helps, > Akira It surely did help, thx for the ref. However, note this quote from the very perfbook: ============= Quick Quiz 7.19: p.110 How might the lock holder be interfered with? Answer: If the data protected by the lock is in the same cache line as the lock itself, then attempts by other CPUs to acquire the lock will result in expensive cache misses on the part of the CPU holding the lock. This is a special case of false sharing, which can also occur if a pair of variables protected by different locks happen to share a cache line. In contrast, if the lock is in a different cache line than the data that it protects, the CPU holding the lock will usually suffer a cache miss only on first access to a given variable. Of course, the downside of placing the lock and data into separate cache lines is that the code will incur two cache misses rather than only one in the uncontended case. As always, choose wisely! ============= In other words, in the contended case the lock data cannot be simultaneously in two core's cache lines, so the cache miss logic will have to do something expensive. Table E.1: Performance of Synchronization Mechanisms on 16-CPU 2.8 GHz Intel X5550 (Nehalem) System (page 480 of the perfbook) Operation Cost (ns) Ratio (cost/clock) -------------------------------------------- Clock period 0.4 1.0 -------------------------------------------- Same-CPU CAS 12.2 33.8 lock 25.6 71.2 -------------------------------------------- On-Core Blind CAS 12.9 35.8 CAS 7.0 19.4 -------------------------------------------- Off-Core Blind CAS 31.2 86.6 CAS 31.2 86.5 -------------------------------------------- Off-Socket Blind CAS 92.4 256.7 CAS 95.9 266.4 -------------------------------------------- Off-System Comms Fabric 2,600 7,220 Global Comms 195,000,000 542,000,000 Table E.2: CPU 0 View of Synchronization Mechanisms on 12-CPU Intel Core i7-8750H CPU @ 2.20 GHz Operation Cost (ns) Ratio (cost/clock) CPUs ---------------------------------------------------- Clock period 0.5 1.0 ---------------------------------------------------- Same-CPU 0 CAS 6.2 13.6 lock 13.5 29.6 ---------------------------------------------------- On-Core 6 Blind CAS 6.5 14.3 CAS 16.2 35.6 ---------------------------------------------------- Off-Core 1–5 Blind CAS 22.2 48.8 7–11 CAS 53.6 117.9 ---------------------------------------------------- Off-System Comms Fabric 5,000 11,000 Global Comms 195,000,000 429,000,000 So, on those systems, in the lock contended case and one threat acquiring and releasing spinlock 11 times like in the RTL8125 driver, the cost can mount up from 770 clock to 266 × 11 ~ 2660 + 266 = 2926 clock cycles. 95 × 11 ns = 950 + 95 ns = 1045 ns. (In the worst case, that two threads contend each time for acquiring the lock. The minimum cost is 25.6 × 11 = 256 + 25.6 ns = 281.6 ns for the uncontended case. For the Nehalem system. But the lock were added for the concurrent access in the first place. Besides, the Realtek logic might be confused in the rare case that the MMIO programming sequence is not sequential, but interrupted when the lock is released instead of adding MMIO instructions in uninterrupted bulk sequence. So, it is risky to release the lock before the sequence is finished.) So, my calculations below weren't exaggerated. Locking is faster if the lock is cached on the local core, but the downside is that we need them exactly to synchronise what is happening between the threads on two independent cores. >> Doing that in storm of 10 lock/unlock pairs amounts to 500 cycles or 125 ns in the best case >> on a 4 GHz CPU. >> >> But I trust that you as the maintainer have the big picture and greater insight in the actual hw. Hope this helps. Best regards, Mirsad Todorovac
diff --git a/drivers/net/ethernet/realtek/r8169_main.c b/drivers/net/ethernet/realtek/r8169_main.c index 50fbacb05953..0778cd0ba2e0 100644 --- a/drivers/net/ethernet/realtek/r8169_main.c +++ b/drivers/net/ethernet/realtek/r8169_main.c @@ -3553,6 +3553,28 @@ DECLARE_RTL_COND(rtl_mac_ocp_e00e_cond) static void rtl_hw_start_8125_common(struct rtl8169_private *tp) { + static const struct e_info_regmaskset e_info_8125_common_1[] = { + { 0xd3e2, 0x0fff, 0x03a9 }, + { 0xd3e4, 0x00ff, 0x0000 }, + { 0xe860, 0x0000, 0x0080 }, + }; + + static const struct e_info_regmaskset e_info_8125_common_2[] = { + { 0xc0b4, 0x0000, 0x000c }, + { 0xeb6a, 0x00ff, 0x0033 }, + { 0xeb50, 0x03e0, 0x0040 }, + { 0xe056, 0x00f0, 0x0030 }, + { 0xe040, 0x1000, 0x0000 }, + { 0xea1c, 0x0003, 0x0001 }, + { 0xe0c0, 0x4f0f, 0x4403 }, + { 0xe052, 0x0080, 0x0068 }, + { 0xd430, 0x0fff, 0x047f }, + { 0xea1c, 0x0004, 0x0000 }, + { 0xeb54, 0x0000, 0x0001 }, + }; + + unsigned long flags; + rtl_pcie_state_l2l3_disable(tp); RTL_W16(tp, 0x382, 0x221b); @@ -3560,47 +3582,38 @@ static void rtl_hw_start_8125_common(struct rtl8169_private *tp) RTL_W16(tp, 0x4800, 0); /* disable UPS */ - r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); + + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); + __r8168_mac_ocp_modify(tp, 0xd40a, 0x0010, 0x0000); RTL_W8(tp, Config1, RTL_R8(tp, Config1) & ~0x10); - r8168_mac_ocp_write(tp, 0xc140, 0xffff); - r8168_mac_ocp_write(tp, 0xc142, 0xffff); + __r8168_mac_ocp_write(tp, 0xc140, 0xffff); + __r8168_mac_ocp_write(tp, 0xc142, 0xffff); - r8168_mac_ocp_modify(tp, 0xd3e2, 0x0fff, 0x03a9); - r8168_mac_ocp_modify(tp, 0xd3e4, 0x00ff, 0x0000); - r8168_mac_ocp_modify(tp, 0xe860, 0x0000, 0x0080); + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_1); /* disable new tx descriptor format */ - r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); + __r8168_mac_ocp_modify(tp, 0xeb58, 0x0001, 0x0000); - if (tp->mac_version == RTL_GIGA_MAC_VER_63) - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); - else - r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); + if (tp->mac_version == RTL_GIGA_MAC_VER_63) { + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0200); + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); + } else { + __r8168_mac_ocp_modify(tp, 0xe614, 0x0700, 0x0400); + __r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); + } + + __r8168_mac_ocp_modify_seq(tp, e_info_8125_common_2); + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); - if (tp->mac_version == RTL_GIGA_MAC_VER_63) - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0000); - else - r8168_mac_ocp_modify(tp, 0xe63e, 0x0c30, 0x0020); - - r8168_mac_ocp_modify(tp, 0xc0b4, 0x0000, 0x000c); - r8168_mac_ocp_modify(tp, 0xeb6a, 0x00ff, 0x0033); - r8168_mac_ocp_modify(tp, 0xeb50, 0x03e0, 0x0040); - r8168_mac_ocp_modify(tp, 0xe056, 0x00f0, 0x0030); - r8168_mac_ocp_modify(tp, 0xe040, 0x1000, 0x0000); - r8168_mac_ocp_modify(tp, 0xea1c, 0x0003, 0x0001); - r8168_mac_ocp_modify(tp, 0xe0c0, 0x4f0f, 0x4403); - r8168_mac_ocp_modify(tp, 0xe052, 0x0080, 0x0068); - r8168_mac_ocp_modify(tp, 0xd430, 0x0fff, 0x047f); - - r8168_mac_ocp_modify(tp, 0xea1c, 0x0004, 0x0000); - r8168_mac_ocp_modify(tp, 0xeb54, 0x0000, 0x0001); udelay(1); - r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); - RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); - r8168_mac_ocp_write(tp, 0xe098, 0xc302); + raw_spin_lock_irqsave(&tp->mac_ocp_lock, flags); + __r8168_mac_ocp_modify(tp, 0xeb54, 0x0001, 0x0000); + RTL_W16(tp, 0x1880, RTL_R16(tp, 0x1880) & ~0x0030); + __r8168_mac_ocp_write(tp, 0xe098, 0xc302); + raw_spin_unlock_irqrestore(&tp->mac_ocp_lock, flags); rtl_loop_wait_low(tp, &rtl_mac_ocp_e00e_cond, 1000, 10);