Message ID | 20221201002719.2596558-4-ira.weiny@intel.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:adf:f944:0:0:0:0:0 with SMTP id q4csp1228081wrr; Wed, 30 Nov 2022 16:30:13 -0800 (PST) X-Google-Smtp-Source: AA0mqf4Z7LzoE19GYDk/nzo3dOdjEaHmGPrG3Zfejo1jrWvCGvBhto/agVMmRqgHuT/c8gSfLykO X-Received: by 2002:a17:906:fc9:b0:7ae:ef99:6fb2 with SMTP id c9-20020a1709060fc900b007aeef996fb2mr53181358ejk.761.1669854613679; Wed, 30 Nov 2022 16:30:13 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1669854613; cv=none; d=google.com; s=arc-20160816; b=ypOsf6tpZ028UbS3WUAuz4wQ1vUi4IdOPzHODXJFroZ+mGOX760gg84mDx79TZRxx1 Ek4yz8m/h75CJoA4//eyadvILdJZl7oCTjBOl3B9O4K1qiBg3OJ59eLW+UnktG2zEim/ QwTcxXU/GaUWFDe4JEGD0iGfxNN0dYPiqSTgujKThaGPnn3NbaT493510mARbm7cCHGb Fb51yLafIaoZKkedOmaLEKHP28CVOpw1euhkLd8vdmgcq28Sc2W2+AC/EUAqfAseM9K2 0dJOLZ9q6WbJk+GnnlFucorb4Ku/t8XhmNElkfq81AzulJOtq5N1ds8LfKupfz3MqcJs 9c+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=mWX/KkWDQcEemrUJFqTcFdnpht3bk3APXVDugf2s6rs=; b=Lzeqe8cT6a/xcK5gQJxXBnKC89xJpbRt1bIb2+B10nKDalyFyamPAUE3PcI6G7QwHj sK03RQ9VeOLoi0feB63wfciJxEZymkFpGL6lyLqFgeHet5INZXxvLgyoVpKkdablBlJ9 RAKCWsz0JxjO4atOfdxWlrcfGePHCHPjgHOylnI3d61vMlOM9kNJoOPjyfjUCwqt1FVR yvqNBlejidvgdjVbQg/2pqSwrkOVovMvonCVRnAxjD9P4eST22vuxNb2FjWvW+wPfetr RbVlrbqBhIgaPz1bW3fOmhiTI35a8zmx2ETSCqkKiJrQABe37xbEOpIhDTWrOrdKqbAp td0g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b="R/EhuWUK"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id gs31-20020a1709072d1f00b0078e1d1d6005si2992731ejc.23.2022.11.30.16.29.49; Wed, 30 Nov 2022 16:30:13 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b="R/EhuWUK"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229759AbiLAA1k (ORCPT <rfc822;heyuhang3455@gmail.com> + 99 others); Wed, 30 Nov 2022 19:27:40 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45948 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229675AbiLAA1b (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Wed, 30 Nov 2022 19:27:31 -0500 Received: from mga03.intel.com (mga03.intel.com [134.134.136.65]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E3E925803E; Wed, 30 Nov 2022 16:27:30 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1669854450; x=1701390450; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=jXysWS4Ez9au+rW5fyav+5s0bOnnmJEkvkXUz0fzpXM=; b=R/EhuWUKEnSKvXQKocMczcL0A+n5+j2ZJgOyVaQT5fzMD5ZXsZL6M43E y0Xqo+AdHvWN76Yngz8v6llJqEv2a2EobBclQMRO+S9FZEvTqZ81rGm6y Bzl4y4QWHOqFjLelN4udWKOeNa4NUV9ckTAS14yvXewH/7ivFzrGJhSrH dQDJ8V/FNycw28P5/uEvOhcDydmIpHxpMPqGpqXD/wyRXBxAzR/KOuwvp WM1bkZ0zWT/dfeoLtGqCy+4Cwr7JHvpyeyIdEJ0CzoVwen7qSAGcHKYPb QUpHH+A0HGVm0LKjOxgqA25yRIF8KlGU9v9+h66Oe6aa2cvLIUxL4YyMC g==; X-IronPort-AV: E=McAfee;i="6500,9779,10547"; a="317400840" X-IronPort-AV: E=Sophos;i="5.96,207,1665471600"; d="scan'208";a="317400840" Received: from orsmga006.jf.intel.com ([10.7.209.51]) by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 30 Nov 2022 16:27:29 -0800 X-IronPort-AV: E=McAfee;i="6500,9779,10547"; a="622085218" X-IronPort-AV: E=Sophos;i="5.96,207,1665471600"; d="scan'208";a="622085218" Received: from iweiny-mobl.amr.corp.intel.com (HELO localhost) ([10.251.1.240]) by orsmga006-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 30 Nov 2022 16:27:28 -0800 From: ira.weiny@intel.com To: Dan Williams <dan.j.williams@intel.com> Cc: Ira Weiny <ira.weiny@intel.com>, Alison Schofield <alison.schofield@intel.com>, Vishal Verma <vishal.l.verma@intel.com>, Ben Widawsky <bwidawsk@kernel.org>, Steven Rostedt <rostedt@goodmis.org>, Jonathan Cameron <Jonathan.Cameron@huawei.com>, Davidlohr Bueso <dave@stgolabs.net>, Dave Jiang <dave.jiang@intel.com>, linux-kernel@vger.kernel.org, linux-cxl@vger.kernel.org Subject: [PATCH V2 03/11] cxl/mem: Implement Clear Event Records command Date: Wed, 30 Nov 2022 16:27:11 -0800 Message-Id: <20221201002719.2596558-4-ira.weiny@intel.com> X-Mailer: git-send-email 2.37.2 In-Reply-To: <20221201002719.2596558-1-ira.weiny@intel.com> References: <20221201002719.2596558-1-ira.weiny@intel.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1750969471620809558?= X-GMAIL-MSGID: =?utf-8?q?1750969471620809558?= |
Series |
CXL: Process event logs
|
|
Commit Message
Ira Weiny
Dec. 1, 2022, 12:27 a.m. UTC
From: Ira Weiny <ira.weiny@intel.com> CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox command. After an event record is read it needs to be cleared from the event log. Implement cxl_clear_event_record() to clear all record retrieved from the device. Each record is cleared explicitly. A clear all bit is specified but events could arrive between a get and any final clear all operation. This means events would be missed. Therefore each event is cleared specifically. Signed-off-by: Ira Weiny <ira.weiny@intel.com> --- Changes from V1: Clear Event Record allows for u8 handles while Get Event Record allows for u16 records to be returned. Based on Jonathan's feedback; allow for all event records to be handled in this clear. Which means a double loop with potentially multiple Clear Event payloads being sent to clear all events sent. Changes from RFC: Jonathan Clean up init of payload and use return code. Also report any error to clear the event. s/v3.0/rev 3.0 --- drivers/cxl/core/mbox.c | 61 +++++++++++++++++++++++++++++++----- drivers/cxl/cxlmem.h | 14 +++++++++ include/uapi/linux/cxl_mem.h | 1 + 3 files changed, 69 insertions(+), 7 deletions(-)
Comments
On Wed, 30 Nov 2022 16:27:11 -0800 ira.weiny@intel.com wrote: > From: Ira Weiny <ira.weiny@intel.com> > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > command. After an event record is read it needs to be cleared from the > event log. > > Implement cxl_clear_event_record() to clear all record retrieved from > the device. > > Each record is cleared explicitly. A clear all bit is specified but > events could arrive between a get and any final clear all operation. > This means events would be missed. > Therefore each event is cleared specifically. > > Signed-off-by: Ira Weiny <ira.weiny@intel.com> I think there is a type issue on the min_t() calculation with that addressed this looks good to me. Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> > > --- > Changes from V1: > Clear Event Record allows for u8 handles while Get Event Record > allows for u16 records to be returned. Based on Jonathan's > feedback; allow for all event records to be handled in this > clear. Which means a double loop with potentially multiple > Clear Event payloads being sent to clear all events sent. > > Changes from RFC: > Jonathan > Clean up init of payload and use return code. > Also report any error to clear the event. > s/v3.0/rev 3.0 > --- > drivers/cxl/core/mbox.c | 61 +++++++++++++++++++++++++++++++----- > drivers/cxl/cxlmem.h | 14 +++++++++ > include/uapi/linux/cxl_mem.h | 1 + > 3 files changed, 69 insertions(+), 7 deletions(-) > > diff --git a/drivers/cxl/core/mbox.c b/drivers/cxl/core/mbox.c > index 70b681027a3d..076a3df0ba38 100644 > --- a/drivers/cxl/core/mbox.c > +++ b/drivers/cxl/core/mbox.c > @@ -52,6 +52,7 @@ static struct cxl_mem_command cxl_mem_commands[CXL_MEM_COMMAND_ID_MAX] = { > #endif > CXL_CMD(GET_SUPPORTED_LOGS, 0, CXL_VARIABLE_PAYLOAD, CXL_CMD_FLAG_FORCE_ENABLE), > CXL_CMD(GET_EVENT_RECORD, 1, CXL_VARIABLE_PAYLOAD, 0), > + CXL_CMD(CLEAR_EVENT_RECORD, CXL_VARIABLE_PAYLOAD, 0, 0), > CXL_CMD(GET_FW_INFO, 0, 0x50, 0), > CXL_CMD(GET_PARTITION_INFO, 0, 0x20, 0), > CXL_CMD(GET_LSA, 0x8, CXL_VARIABLE_PAYLOAD, 0), > @@ -708,6 +709,42 @@ int cxl_enumerate_cmds(struct cxl_dev_state *cxlds) > } > EXPORT_SYMBOL_NS_GPL(cxl_enumerate_cmds, CXL); > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > + enum cxl_event_log_type log, > + struct cxl_get_event_payload *get_pl, > + u16 total) > +{ > + struct cxl_mbox_clear_event_payload payload = { > + .event_log = log, > + }; > + int cnt; > + > + /* > + * Clear Event Records uses u8 for the handle cnt while Get Event > + * Record can return up to 0xffff records. > + */ > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > + u8 nr_recs = min_t(u8, (total - cnt), > + CXL_CLEAR_EVENT_MAX_HANDLES); I might be half asleep but isn't this assuming that (total - cnt) fits in an u8? Shouldn't this be min_t(u16, ..) Also, maybe u16 cnt would be simpler. Hmm. This is safe but only because of how you call it alongside handling of a particular Get event records response (which must have fitted in the mailbox and has a longer header). Looking at this function in isolation, I think the mailbox could be small enough that we might not fit 255 records + the header. Perhaps we need a comment to say that, or at minimum a check and error return if it won't fit? > + int i, rc; > + > + for (i = 0; i < nr_recs; i++, cnt++) { > + payload.handle[i] = get_pl->records[cnt].hdr.handle; > + dev_dbg(cxlds->dev, "Event log '%s': Clearning %u\n", > + cxl_event_log_type_str(log), > + le16_to_cpu(payload.handle[i])); > + } > + payload.nr_recs = nr_recs; > + > + rc = cxl_mbox_send_cmd(cxlds, CXL_MBOX_OP_CLEAR_EVENT_RECORD, > + &payload, sizeof(payload), NULL, 0); > + if (rc) > + return rc; > + } > + > + return 0; > +} > + > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > enum cxl_event_log_type type) > { > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, This feels miss named now but I can't immediately think of better naming so on that basis fine to leave it as is if you don't have a better idea!. > } > > nr_rec = le16_to_cpu(payload->record_count); > - if (trace_cxl_generic_event_enabled()) { > + if (nr_rec > 0) { > int i; > > - for (i = 0; i < nr_rec; i++) > - trace_cxl_generic_event(dev_name(cxlds->dev), > - type, > - &payload->records[i]); > + if (trace_cxl_generic_event_enabled()) { > + for (i = 0; i < nr_rec; i++) > + trace_cxl_generic_event(dev_name(cxlds->dev), > + type, > + &payload->records[i]); > + } > + > + rc = cxl_clear_event_record(cxlds, type, payload, nr_rec); > + if (rc) { > + dev_err(cxlds->dev, "Event log '%s': Failed to clear events : %d", > + cxl_event_log_type_str(type), rc); > + return; > + } > }
On Thu, Dec 01, 2022 at 01:26:18PM +0000, Jonathan Cameron wrote: > On Wed, 30 Nov 2022 16:27:11 -0800 > ira.weiny@intel.com wrote: > > > From: Ira Weiny <ira.weiny@intel.com> > > > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > > command. After an event record is read it needs to be cleared from the > > event log. > > > > Implement cxl_clear_event_record() to clear all record retrieved from > > the device. > > > > Each record is cleared explicitly. A clear all bit is specified but > > events could arrive between a get and any final clear all operation. > > This means events would be missed. > > Therefore each event is cleared specifically. > > > > Signed-off-by: Ira Weiny <ira.weiny@intel.com> > I think there is a type issue on the min_t() calculation with that addressed > this looks good to me. > > Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> > > > > > --- > > Changes from V1: > > Clear Event Record allows for u8 handles while Get Event Record > > allows for u16 records to be returned. Based on Jonathan's > > feedback; allow for all event records to be handled in this > > clear. Which means a double loop with potentially multiple > > Clear Event payloads being sent to clear all events sent. > > > > Changes from RFC: > > Jonathan > > Clean up init of payload and use return code. > > Also report any error to clear the event. > > s/v3.0/rev 3.0 > > --- > > drivers/cxl/core/mbox.c | 61 +++++++++++++++++++++++++++++++----- > > drivers/cxl/cxlmem.h | 14 +++++++++ > > include/uapi/linux/cxl_mem.h | 1 + > > 3 files changed, 69 insertions(+), 7 deletions(-) > > > > diff --git a/drivers/cxl/core/mbox.c b/drivers/cxl/core/mbox.c > > index 70b681027a3d..076a3df0ba38 100644 > > --- a/drivers/cxl/core/mbox.c > > +++ b/drivers/cxl/core/mbox.c > > @@ -52,6 +52,7 @@ static struct cxl_mem_command cxl_mem_commands[CXL_MEM_COMMAND_ID_MAX] = { > > #endif > > CXL_CMD(GET_SUPPORTED_LOGS, 0, CXL_VARIABLE_PAYLOAD, CXL_CMD_FLAG_FORCE_ENABLE), > > CXL_CMD(GET_EVENT_RECORD, 1, CXL_VARIABLE_PAYLOAD, 0), > > + CXL_CMD(CLEAR_EVENT_RECORD, CXL_VARIABLE_PAYLOAD, 0, 0), > > CXL_CMD(GET_FW_INFO, 0, 0x50, 0), > > CXL_CMD(GET_PARTITION_INFO, 0, 0x20, 0), > > CXL_CMD(GET_LSA, 0x8, CXL_VARIABLE_PAYLOAD, 0), > > @@ -708,6 +709,42 @@ int cxl_enumerate_cmds(struct cxl_dev_state *cxlds) > > } > > EXPORT_SYMBOL_NS_GPL(cxl_enumerate_cmds, CXL); > > > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > > + enum cxl_event_log_type log, > > + struct cxl_get_event_payload *get_pl, > > + u16 total) > > +{ > > + struct cxl_mbox_clear_event_payload payload = { > > + .event_log = log, > > + }; > > + int cnt; > > + > > + /* > > + * Clear Event Records uses u8 for the handle cnt while Get Event > > + * Record can return up to 0xffff records. > > + */ > > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > > + u8 nr_recs = min_t(u8, (total - cnt), > > + CXL_CLEAR_EVENT_MAX_HANDLES); > > I might be half asleep but isn't this assuming that (total - cnt) > fits in an u8? Shouldn't this be min_t(u16, ..) This cast will ensure the value is never out of range for nr_recs which needs to be u8 and (total - cnt) will never be negative. But now you have me double thinking myself. > Also, maybe u16 cnt would be simpler. > > Hmm. This is safe but only because of how you call it alongside > handling of a particular Get event records response (which must > have fitted in the mailbox and has a longer header). > > Looking at this function in isolation, I think the mailbox could be > small enough that we might not fit 255 records + the header. > Perhaps we need a comment to say that, or at minimum a check and error > return if it won't fit? I did not realize that Payload Size applied to input payloads as well. :-/ There is no check in the send command for that ATM. Looking at the spec I think you are right. I'll further limit the payload size here too. And with this I might get rid of the min_t() and just cap based on that value. > > > + int i, rc; > > + > > + for (i = 0; i < nr_recs; i++, cnt++) { > > + payload.handle[i] = get_pl->records[cnt].hdr.handle; > > + dev_dbg(cxlds->dev, "Event log '%s': Clearning %u\n", > > + cxl_event_log_type_str(log), > > + le16_to_cpu(payload.handle[i])); > > + } > > + payload.nr_recs = nr_recs; > > + > > + rc = cxl_mbox_send_cmd(cxlds, CXL_MBOX_OP_CLEAR_EVENT_RECORD, > > + &payload, sizeof(payload), NULL, 0); > > + if (rc) > > + return rc; > > + } > > + > > + return 0; > > +} > > + > > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > enum cxl_event_log_type type) > > { > > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > This feels miss named now but I can't immediately think of better naming so on that > basis fine to leave it as is if you don't have a better idea!. So we leave it. Naming is hard! :-D Thanks for the quick review, V3 coming ASAP. Ira
ira.weiny@ wrote: > From: Ira Weiny <ira.weiny@intel.com> > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > command. After an event record is read it needs to be cleared from the > event log. > > Implement cxl_clear_event_record() to clear all record retrieved from > the device. > > Each record is cleared explicitly. A clear all bit is specified but > events could arrive between a get and any final clear all operation. > This means events would be missed. > Therefore each event is cleared specifically. Note that the spec has a better reason for why Clear All has limited usage: "Clear All Events is only allowed when the Event Log has overflowed; otherwise, the device shall return Invalid Input." Will need to wait and see if we need that to keep pace with a device with a high event frequency. > > Signed-off-by: Ira Weiny <ira.weiny@intel.com> > > --- > Changes from V1: > Clear Event Record allows for u8 handles while Get Event Record > allows for u16 records to be returned. Based on Jonathan's > feedback; allow for all event records to be handled in this > clear. Which means a double loop with potentially multiple > Clear Event payloads being sent to clear all events sent. > > Changes from RFC: > Jonathan > Clean up init of payload and use return code. > Also report any error to clear the event. > s/v3.0/rev 3.0 > --- > drivers/cxl/core/mbox.c | 61 +++++++++++++++++++++++++++++++----- > drivers/cxl/cxlmem.h | 14 +++++++++ > include/uapi/linux/cxl_mem.h | 1 + > 3 files changed, 69 insertions(+), 7 deletions(-) > > diff --git a/drivers/cxl/core/mbox.c b/drivers/cxl/core/mbox.c > index 70b681027a3d..076a3df0ba38 100644 > --- a/drivers/cxl/core/mbox.c > +++ b/drivers/cxl/core/mbox.c > @@ -52,6 +52,7 @@ static struct cxl_mem_command cxl_mem_commands[CXL_MEM_COMMAND_ID_MAX] = { > #endif > CXL_CMD(GET_SUPPORTED_LOGS, 0, CXL_VARIABLE_PAYLOAD, CXL_CMD_FLAG_FORCE_ENABLE), > CXL_CMD(GET_EVENT_RECORD, 1, CXL_VARIABLE_PAYLOAD, 0), > + CXL_CMD(CLEAR_EVENT_RECORD, CXL_VARIABLE_PAYLOAD, 0, 0), > CXL_CMD(GET_FW_INFO, 0, 0x50, 0), > CXL_CMD(GET_PARTITION_INFO, 0, 0x20, 0), > CXL_CMD(GET_LSA, 0x8, CXL_VARIABLE_PAYLOAD, 0), > @@ -708,6 +709,42 @@ int cxl_enumerate_cmds(struct cxl_dev_state *cxlds) > } > EXPORT_SYMBOL_NS_GPL(cxl_enumerate_cmds, CXL); > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > + enum cxl_event_log_type log, > + struct cxl_get_event_payload *get_pl, > + u16 total) > +{ > + struct cxl_mbox_clear_event_payload payload = { > + .event_log = log, > + }; > + int cnt; > + > + /* > + * Clear Event Records uses u8 for the handle cnt while Get Event > + * Record can return up to 0xffff records. > + */ > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > + u8 nr_recs = min_t(u8, (total - cnt), > + CXL_CLEAR_EVENT_MAX_HANDLES); This seems overly complicated. @total is a duplicate of @get_pl->record_count, and the 2 loops feel like it could be cut down to one. > + int i, rc; > + > + for (i = 0; i < nr_recs; i++, cnt++) { > + payload.handle[i] = get_pl->records[cnt].hdr.handle; > + dev_dbg(cxlds->dev, "Event log '%s': Clearning %u\n", While I do think this operation is a mix of clearing and cleaning event records, I don't think "Clearning" is a word. > + cxl_event_log_type_str(log), > + le16_to_cpu(payload.handle[i])); > + } > + payload.nr_recs = nr_recs; > + > + rc = cxl_mbox_send_cmd(cxlds, CXL_MBOX_OP_CLEAR_EVENT_RECORD, > + &payload, sizeof(payload), NULL, 0); > + if (rc) > + return rc; > + } > + > + return 0; > +} > + > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > enum cxl_event_log_type type) > { > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > } > > nr_rec = le16_to_cpu(payload->record_count); > - if (trace_cxl_generic_event_enabled()) { > + if (nr_rec > 0) { > int i; > > - for (i = 0; i < nr_rec; i++) > - trace_cxl_generic_event(dev_name(cxlds->dev), > - type, > - &payload->records[i]); > + if (trace_cxl_generic_event_enabled()) { Again, trace_cxl_generic_event_enabled() injects some awkward formatting here to micro-optimize looping. Any performance benefit this code might offer is likely offset by the extra human effort to read it. > + for (i = 0; i < nr_rec; i++) > + trace_cxl_generic_event(dev_name(cxlds->dev), > + type, > + &payload->records[i]); > + } > + > + rc = cxl_clear_event_record(cxlds, type, payload, nr_rec); > + if (rc) { > + dev_err(cxlds->dev, "Event log '%s': Failed to clear events : %d", > + cxl_event_log_type_str(type), rc); > + return; > + } > } > > if (trace_cxl_overflow_enabled() && > @@ -780,10 +826,11 @@ static struct cxl_get_event_payload *alloc_event_buf(struct cxl_dev_state *cxlds > * cxl_mem_get_event_records - Get Event Records from the device > * @cxlds: The device data for the operation > * > - * Retrieve all event records available on the device and report them as trace > - * events. > + * Retrieve all event records available on the device, report them as trace > + * events, and clear them. > * > * See CXL rev 3.0 @8.2.9.2.2 Get Event Records > + * See CXL rev 3.0 @8.2.9.2.3 Clear Event Records > */ > void cxl_mem_get_event_records(struct cxl_dev_state *cxlds) > { > diff --git a/drivers/cxl/cxlmem.h b/drivers/cxl/cxlmem.h > index 55d57f5a64bc..1ae9962c5a06 100644 > --- a/drivers/cxl/cxlmem.h > +++ b/drivers/cxl/cxlmem.h > @@ -261,6 +261,7 @@ enum cxl_opcode { > CXL_MBOX_OP_INVALID = 0x0000, > CXL_MBOX_OP_RAW = CXL_MBOX_OP_INVALID, > CXL_MBOX_OP_GET_EVENT_RECORD = 0x0100, > + CXL_MBOX_OP_CLEAR_EVENT_RECORD = 0x0101, > CXL_MBOX_OP_GET_FW_INFO = 0x0200, > CXL_MBOX_OP_ACTIVATE_FW = 0x0202, > CXL_MBOX_OP_GET_SUPPORTED_LOGS = 0x0400, > @@ -396,6 +397,19 @@ static inline const char *cxl_event_log_type_str(enum cxl_event_log_type type) > return "<unknown>"; > } > > +/* > + * Clear Event Records input payload > + * CXL rev 3.0 section 8.2.9.2.3; Table 8-51 > + */ > +#define CXL_CLEAR_EVENT_MAX_HANDLES (0xff) > +struct cxl_mbox_clear_event_payload { > + u8 event_log; /* enum cxl_event_log_type */ > + u8 clear_flags; > + u8 nr_recs; > + u8 reserved[3]; > + __le16 handle[CXL_CLEAR_EVENT_MAX_HANDLES]; > +}; > + > struct cxl_mbox_get_partition_info { > __le64 active_volatile_cap; > __le64 active_persistent_cap; > diff --git a/include/uapi/linux/cxl_mem.h b/include/uapi/linux/cxl_mem.h > index 70459be5bdd4..7c1ad8062792 100644 > --- a/include/uapi/linux/cxl_mem.h > +++ b/include/uapi/linux/cxl_mem.h > @@ -25,6 +25,7 @@ > ___C(RAW, "Raw device command"), \ > ___C(GET_SUPPORTED_LOGS, "Get Supported Logs"), \ > ___C(GET_EVENT_RECORD, "Get Event Record"), \ > + ___C(CLEAR_EVENT_RECORD, "Clear Event Record"), \ > ___C(GET_FW_INFO, "Get FW Info"), \ > ___C(GET_PARTITION_INFO, "Get Partition Information"), \ > ___C(GET_LSA, "Get Label Storage Area"), \ Same, "yikes" / "must be at the end of the enum" feedback.
> > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > > + enum cxl_event_log_type log, > > + struct cxl_get_event_payload *get_pl, > > + u16 total) > > +{ > > + struct cxl_mbox_clear_event_payload payload = { > > + .event_log = log, > > + }; > > + int cnt; > > + > > + /* > > + * Clear Event Records uses u8 for the handle cnt while Get Event > > + * Record can return up to 0xffff records. > > + */ > > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > > + u8 nr_recs = min_t(u8, (total - cnt), > > + CXL_CLEAR_EVENT_MAX_HANDLES); > > This seems overly complicated. @total is a duplicate of > @get_pl->record_count, and the 2 loops feel like it could be cut > down to one. You could do something nasty like for (i = 0; i < total; i++) { ... payload.handle[i % CLEAR_EVENT_MAX_HANDLES] = ... if (i % CXL_CLEAR_EVENT_MAX_HANDLES == CXL_CLEAR_EVENT_MAX_HANDLE - 1) { send command. } } but that looks worse to me than the double loop. Making outer loop for (j = 0; j <= total / CXL_CLEAR_EVENT_MAX_HANDLES; j++) might bet clearer but then you'd have to do records[j * CXL_CLEAR_EVENT_MAX_HANDLES + i] which isn't nice. Ah well, Ira gets to try and find a happy compromise. ... > > diff --git a/include/uapi/linux/cxl_mem.h b/include/uapi/linux/cxl_mem.h > > index 70459be5bdd4..7c1ad8062792 100644 > > --- a/include/uapi/linux/cxl_mem.h > > +++ b/include/uapi/linux/cxl_mem.h > > @@ -25,6 +25,7 @@ > > ___C(RAW, "Raw device command"), \ > > ___C(GET_SUPPORTED_LOGS, "Get Supported Logs"), \ > > ___C(GET_EVENT_RECORD, "Get Event Record"), \ > > + ___C(CLEAR_EVENT_RECORD, "Clear Event Record"), \ > > ___C(GET_FW_INFO, "Get FW Info"), \ > > ___C(GET_PARTITION_INFO, "Get Partition Information"), \ > > ___C(GET_LSA, "Get Label Storage Area"), \ > > Same, "yikes" / "must be at the end of the enum" feedback. Macro magic makes that non obvious.. Not that I'd ever said I thought this trick was a bad idea ;)
On Thu, 1 Dec 2022 18:29:20 -0800 Dan Williams <dan.j.williams@intel.com> wrote: > > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > enum cxl_event_log_type type) > > { > > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > } > > > > nr_rec = le16_to_cpu(payload->record_count); > > - if (trace_cxl_generic_event_enabled()) { > > + if (nr_rec > 0) { > > int i; > > > > - for (i = 0; i < nr_rec; i++) > > - trace_cxl_generic_event(dev_name(cxlds->dev), > > - type, > > - &payload->records[i]); > > + if (trace_cxl_generic_event_enabled()) { > > Again, trace_cxl_generic_event_enabled() injects some awkward > formatting here to micro-optimize looping. Any performance benefit this > code might offer is likely offset by the extra human effort to read it. This is commonly used throughout the kernel, and highly suggested for use to encapsulate any work being done only for tracing, when tracing is disabled. It uses static_braches/jump_labels which makes the loop into a 'nop' when tracing is off. That is, there is zero overhead for the for loop below (and there's not even a branch to skip it!) But sure, if you really don't care as it's not a fast path, then keep it out. I like people to keep the habit of doing this, because otherwise it tends to creep into the fast paths. -- Steve > > > + for (i = 0; i < nr_rec; i++) > > + trace_cxl_generic_event(dev_name(cxlds->dev), > > + type, > > + &payload->records[i]); > > + } > > + > > + rc = cxl_clear_event_record(cxlds, type, payload, nr_rec); > > + if (rc) { > > + dev_err(cxlds->dev, "Event log '%s': Failed to clear events : %d", > > + cxl_event_log_type_str(type), rc); > > + return; > > + } > > } > >
Steven Rostedt wrote: > On Thu, 1 Dec 2022 18:29:20 -0800 > Dan Williams <dan.j.williams@intel.com> wrote: > > > > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > > enum cxl_event_log_type type) > > > { > > > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > > } > > > > > > nr_rec = le16_to_cpu(payload->record_count); > > > - if (trace_cxl_generic_event_enabled()) { > > > + if (nr_rec > 0) { > > > int i; > > > > > > - for (i = 0; i < nr_rec; i++) > > > - trace_cxl_generic_event(dev_name(cxlds->dev), > > > - type, > > > - &payload->records[i]); > > > + if (trace_cxl_generic_event_enabled()) { > > > > Again, trace_cxl_generic_event_enabled() injects some awkward > > formatting here to micro-optimize looping. Any performance benefit this > > code might offer is likely offset by the extra human effort to read it. > > This is commonly used throughout the kernel, and highly suggested for use to > encapsulate any work being done only for tracing, when tracing is disabled. > It uses static_braches/jump_labels which makes the loop into a 'nop' when > tracing is off. That is, there is zero overhead for the for loop below (and > there's not even a branch to skip it!) > > But sure, if you really don't care as it's not a fast path, then keep it > out. I like people to keep the habit of doing this, because otherwise it > tends to creep into the fast paths. Duly noted. It makes a lot of sense when you are tracing in a fast path to skip any and all preamble code. In this case we are doing it after doing a whole series of uncached PCI mmio reads with all the stalling and serialization that implies. Speaking of which, this probably wants a cond_resched() after each loop iteration. I'll note it is also a tracepoint that is likely to be enabled most of the time in production.
On Fri, Dec 02, 2022 at 11:27:07AM -0800, Dan Williams wrote: > Steven Rostedt wrote: > > On Thu, 1 Dec 2022 18:29:20 -0800 > > Dan Williams <dan.j.williams@intel.com> wrote: > > > > > > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > > > enum cxl_event_log_type type) > > > > { > > > > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > > > } > > > > > > > > nr_rec = le16_to_cpu(payload->record_count); > > > > - if (trace_cxl_generic_event_enabled()) { > > > > + if (nr_rec > 0) { > > > > int i; > > > > > > > > - for (i = 0; i < nr_rec; i++) > > > > - trace_cxl_generic_event(dev_name(cxlds->dev), > > > > - type, > > > > - &payload->records[i]); > > > > + if (trace_cxl_generic_event_enabled()) { > > > > > > Again, trace_cxl_generic_event_enabled() injects some awkward > > > formatting here to micro-optimize looping. Any performance benefit this > > > code might offer is likely offset by the extra human effort to read it. > > > > This is commonly used throughout the kernel, and highly suggested for use to > > encapsulate any work being done only for tracing, when tracing is disabled. > > It uses static_braches/jump_labels which makes the loop into a 'nop' when > > tracing is off. That is, there is zero overhead for the for loop below (and > > there's not even a branch to skip it!) > > > > But sure, if you really don't care as it's not a fast path, then keep it > > out. I like people to keep the habit of doing this, because otherwise it > > tends to creep into the fast paths. Thanks for chiming in here Steven. I should have pushed back on this. > > Duly noted. It makes a lot of sense when you are tracing in a fast path > to skip any and all preamble code. In this case we are doing it after > doing a whole series of uncached PCI mmio reads with all the stalling > and serialization that implies. > > Speaking of which, this probably wants a cond_resched() after each loop > iteration. > > I'll note it is also a tracepoint that is likely to be enabled most of > the time in production. Ok I did not have any of these in there originally and I will remove them now. Thanks! Ira
On Thu, Dec 01, 2022 at 06:29:20PM -0800, Dan Williams wrote: > ira.weiny@ wrote: > > From: Ira Weiny <ira.weiny@intel.com> > > > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > > command. After an event record is read it needs to be cleared from the > > event log. > > > > Implement cxl_clear_event_record() to clear all record retrieved from > > the device. > > > > Each record is cleared explicitly. A clear all bit is specified but > > events could arrive between a get and any final clear all operation. > > This means events would be missed. > > Therefore each event is cleared specifically. > > Note that the spec has a better reason for why Clear All has limited > usage: > > "Clear All Events is only allowed when the Event Log has overflowed; > otherwise, the device shall return Invalid Input." > > Will need to wait and see if we need that to keep pace with a device > with a high event frequency. Perhaps. But yea I would wait and see. [snip] > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > > + enum cxl_event_log_type log, > > + struct cxl_get_event_payload *get_pl, > > + u16 total) > > +{ > > + struct cxl_mbox_clear_event_payload payload = { > > + .event_log = log, > > + }; > > + int cnt; > > + > > + /* > > + * Clear Event Records uses u8 for the handle cnt while Get Event > > + * Record can return up to 0xffff records. > > + */ > > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > > + u8 nr_recs = min_t(u8, (total - cnt), > > + CXL_CLEAR_EVENT_MAX_HANDLES); > > This seems overly complicated. @total is a duplicate of > @get_pl->record_count, and the 2 loops feel like it could be cut > down to one. Sure, total is redundant to pass to the function. However, 2 loops is IMO not at all overly complicated. Note that the 2 loops do not do the same thing. The inner loop is filling in the payload for the Clear command. There is really no way around doing this. Now that I've had time to think about it: Are you suggesting we issue a single mailbox command for every handle? That would be a single loop. But a lot more mailbox commands. > > > + int i, rc; > > + > > + for (i = 0; i < nr_recs; i++, cnt++) { > > + payload.handle[i] = get_pl->records[cnt].hdr.handle; > > + dev_dbg(cxlds->dev, "Event log '%s': Clearning %u\n", > > While I do think this operation is a mix of clearing and cleaning event > records, I don't think "Clearning" is a word. LOL... I'll fix it. :-D > > > + cxl_event_log_type_str(log), > > + le16_to_cpu(payload.handle[i])); > > + } > > + payload.nr_recs = nr_recs; > > + > > + rc = cxl_mbox_send_cmd(cxlds, CXL_MBOX_OP_CLEAR_EVENT_RECORD, > > + &payload, sizeof(payload), NULL, 0); > > + if (rc) > > + return rc; > > + } > > + > > + return 0; > > +} > > + > > static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > enum cxl_event_log_type type) > > { > > @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, > > } > > > > nr_rec = le16_to_cpu(payload->record_count); > > - if (trace_cxl_generic_event_enabled()) { > > + if (nr_rec > 0) { > > int i; > > > > - for (i = 0; i < nr_rec; i++) > > - trace_cxl_generic_event(dev_name(cxlds->dev), > > - type, > > - &payload->records[i]); > > + if (trace_cxl_generic_event_enabled()) { > > Again, trace_cxl_generic_event_enabled() injects some awkward > formatting here to micro-optimize looping. Any performance benefit this > code might offer is likely offset by the extra human effort to read it. Agreed. Gone. > > > + for (i = 0; i < nr_rec; i++) > > + trace_cxl_generic_event(dev_name(cxlds->dev), > > + type, > > + &payload->records[i]); > > + } > > + > > + rc = cxl_clear_event_record(cxlds, type, payload, nr_rec); > > + if (rc) { > > + dev_err(cxlds->dev, "Event log '%s': Failed to clear events : %d", > > + cxl_event_log_type_str(type), rc); > > + return; > > + } > > } > > > > if (trace_cxl_overflow_enabled() && > > @@ -780,10 +826,11 @@ static struct cxl_get_event_payload *alloc_event_buf(struct cxl_dev_state *cxlds > > * cxl_mem_get_event_records - Get Event Records from the device > > * @cxlds: The device data for the operation > > * > > - * Retrieve all event records available on the device and report them as trace > > - * events. > > + * Retrieve all event records available on the device, report them as trace > > + * events, and clear them. > > * > > * See CXL rev 3.0 @8.2.9.2.2 Get Event Records > > + * See CXL rev 3.0 @8.2.9.2.3 Clear Event Records > > */ > > void cxl_mem_get_event_records(struct cxl_dev_state *cxlds) > > { > > diff --git a/drivers/cxl/cxlmem.h b/drivers/cxl/cxlmem.h > > index 55d57f5a64bc..1ae9962c5a06 100644 > > --- a/drivers/cxl/cxlmem.h > > +++ b/drivers/cxl/cxlmem.h > > @@ -261,6 +261,7 @@ enum cxl_opcode { > > CXL_MBOX_OP_INVALID = 0x0000, > > CXL_MBOX_OP_RAW = CXL_MBOX_OP_INVALID, > > CXL_MBOX_OP_GET_EVENT_RECORD = 0x0100, > > + CXL_MBOX_OP_CLEAR_EVENT_RECORD = 0x0101, > > CXL_MBOX_OP_GET_FW_INFO = 0x0200, > > CXL_MBOX_OP_ACTIVATE_FW = 0x0202, > > CXL_MBOX_OP_GET_SUPPORTED_LOGS = 0x0400, > > @@ -396,6 +397,19 @@ static inline const char *cxl_event_log_type_str(enum cxl_event_log_type type) > > return "<unknown>"; > > } > > > > +/* > > + * Clear Event Records input payload > > + * CXL rev 3.0 section 8.2.9.2.3; Table 8-51 > > + */ > > +#define CXL_CLEAR_EVENT_MAX_HANDLES (0xff) > > +struct cxl_mbox_clear_event_payload { > > + u8 event_log; /* enum cxl_event_log_type */ > > + u8 clear_flags; > > + u8 nr_recs; > > + u8 reserved[3]; > > + __le16 handle[CXL_CLEAR_EVENT_MAX_HANDLES]; > > +}; > > + > > struct cxl_mbox_get_partition_info { > > __le64 active_volatile_cap; > > __le64 active_persistent_cap; > > diff --git a/include/uapi/linux/cxl_mem.h b/include/uapi/linux/cxl_mem.h > > index 70459be5bdd4..7c1ad8062792 100644 > > --- a/include/uapi/linux/cxl_mem.h > > +++ b/include/uapi/linux/cxl_mem.h > > @@ -25,6 +25,7 @@ > > ___C(RAW, "Raw device command"), \ > > ___C(GET_SUPPORTED_LOGS, "Get Supported Logs"), \ > > ___C(GET_EVENT_RECORD, "Get Event Record"), \ > > + ___C(CLEAR_EVENT_RECORD, "Clear Event Record"), \ > > ___C(GET_FW_INFO, "Get FW Info"), \ > > ___C(GET_PARTITION_INFO, "Get Partition Information"), \ > > ___C(GET_LSA, "Get Label Storage Area"), \ > > Same, "yikes" / "must be at the end of the enum" feedback. Yep, Ira
Ira Weiny wrote: > On Thu, Dec 01, 2022 at 06:29:20PM -0800, Dan Williams wrote: > > ira.weiny@ wrote: > > > From: Ira Weiny <ira.weiny@intel.com> > > > > > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > > > command. After an event record is read it needs to be cleared from the > > > event log. > > > > > > Implement cxl_clear_event_record() to clear all record retrieved from > > > the device. > > > > > > Each record is cleared explicitly. A clear all bit is specified but > > > events could arrive between a get and any final clear all operation. > > > This means events would be missed. > > > Therefore each event is cleared specifically. > > > > Note that the spec has a better reason for why Clear All has limited > > usage: > > > > "Clear All Events is only allowed when the Event Log has overflowed; > > otherwise, the device shall return Invalid Input." > > > > Will need to wait and see if we need that to keep pace with a device > > with a high event frequency. > > Perhaps. But yea I would wait and see. > > [snip] > > > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > > > + enum cxl_event_log_type log, > > > + struct cxl_get_event_payload *get_pl, > > > + u16 total) > > > +{ > > > + struct cxl_mbox_clear_event_payload payload = { > > > + .event_log = log, > > > + }; > > > + int cnt; > > > + > > > + /* > > > + * Clear Event Records uses u8 for the handle cnt while Get Event > > > + * Record can return up to 0xffff records. > > > + */ > > > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > > > + u8 nr_recs = min_t(u8, (total - cnt), > > > + CXL_CLEAR_EVENT_MAX_HANDLES); > > > > This seems overly complicated. @total is a duplicate of > > @get_pl->record_count, and the 2 loops feel like it could be cut > > down to one. > > Sure, total is redundant to pass to the function. > > However, 2 loops is IMO not at all overly complicated. Note that the 2 loops > do not do the same thing. The inner loop is filling in the payload for the > Clear command. There is really no way around doing this. > > Now that I've had time to think about it: > > Are you suggesting we issue a single mailbox command for every handle? > > That would be a single loop. But a lot more mailbox commands. I was thinking something like this pseudo code int tosend = le16_to_cpu(get_pl->record_count); int added = 0; for (i = 0; i < tosend; i++) { add_to_clear(added++); if (added == MAX) send_mailbox(); added = 0; } if (added) send_mailbox(); ...where it batches and sends every 256 and one more send afterwards for any stragglers.
On Fri, Dec 02, 2022 at 05:14:27PM -0800, Dan Williams wrote: > Ira Weiny wrote: > > On Thu, Dec 01, 2022 at 06:29:20PM -0800, Dan Williams wrote: > > > ira.weiny@ wrote: > > > > From: Ira Weiny <ira.weiny@intel.com> > > > > > > > > CXL rev 3.0 section 8.2.9.2.3 defines the Clear Event Records mailbox > > > > command. After an event record is read it needs to be cleared from the > > > > event log. > > > > > > > > Implement cxl_clear_event_record() to clear all record retrieved from > > > > the device. > > > > > > > > Each record is cleared explicitly. A clear all bit is specified but > > > > events could arrive between a get and any final clear all operation. > > > > This means events would be missed. > > > > Therefore each event is cleared specifically. > > > > > > Note that the spec has a better reason for why Clear All has limited > > > usage: > > > > > > "Clear All Events is only allowed when the Event Log has overflowed; > > > otherwise, the device shall return Invalid Input." > > > > > > Will need to wait and see if we need that to keep pace with a device > > > with a high event frequency. > > > > Perhaps. But yea I would wait and see. > > > > [snip] > > > > > > +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, > > > > + enum cxl_event_log_type log, > > > > + struct cxl_get_event_payload *get_pl, > > > > + u16 total) > > > > +{ > > > > + struct cxl_mbox_clear_event_payload payload = { > > > > + .event_log = log, > > > > + }; > > > > + int cnt; > > > > + > > > > + /* > > > > + * Clear Event Records uses u8 for the handle cnt while Get Event > > > > + * Record can return up to 0xffff records. > > > > + */ > > > > + for (cnt = 0; cnt < total; /* cnt incremented internally */) { > > > > + u8 nr_recs = min_t(u8, (total - cnt), > > > > + CXL_CLEAR_EVENT_MAX_HANDLES); > > > > > > This seems overly complicated. @total is a duplicate of > > > @get_pl->record_count, and the 2 loops feel like it could be cut > > > down to one. > > > > Sure, total is redundant to pass to the function. > > > > However, 2 loops is IMO not at all overly complicated. Note that the 2 loops > > do not do the same thing. The inner loop is filling in the payload for the > > Clear command. There is really no way around doing this. > > > > Now that I've had time to think about it: > > > > Are you suggesting we issue a single mailbox command for every handle? > > > > That would be a single loop. But a lot more mailbox commands. > > I was thinking something like this pseudo code > > int tosend = le16_to_cpu(get_pl->record_count); > int added = 0; > > for (i = 0; i < tosend; i++) { > add_to_clear(added++); > if (added == MAX) > send_mailbox(); > added = 0; > } > > if (added) > send_mailbox(); > > ...where it batches and sends every 256 and one more send afterwards for > any stragglers. Ok I'm not convinced it makes that much difference but I don't have the fortitude to try and look at the assembly to argue... ;-) Done. Ira
diff --git a/drivers/cxl/core/mbox.c b/drivers/cxl/core/mbox.c index 70b681027a3d..076a3df0ba38 100644 --- a/drivers/cxl/core/mbox.c +++ b/drivers/cxl/core/mbox.c @@ -52,6 +52,7 @@ static struct cxl_mem_command cxl_mem_commands[CXL_MEM_COMMAND_ID_MAX] = { #endif CXL_CMD(GET_SUPPORTED_LOGS, 0, CXL_VARIABLE_PAYLOAD, CXL_CMD_FLAG_FORCE_ENABLE), CXL_CMD(GET_EVENT_RECORD, 1, CXL_VARIABLE_PAYLOAD, 0), + CXL_CMD(CLEAR_EVENT_RECORD, CXL_VARIABLE_PAYLOAD, 0, 0), CXL_CMD(GET_FW_INFO, 0, 0x50, 0), CXL_CMD(GET_PARTITION_INFO, 0, 0x20, 0), CXL_CMD(GET_LSA, 0x8, CXL_VARIABLE_PAYLOAD, 0), @@ -708,6 +709,42 @@ int cxl_enumerate_cmds(struct cxl_dev_state *cxlds) } EXPORT_SYMBOL_NS_GPL(cxl_enumerate_cmds, CXL); +static int cxl_clear_event_record(struct cxl_dev_state *cxlds, + enum cxl_event_log_type log, + struct cxl_get_event_payload *get_pl, + u16 total) +{ + struct cxl_mbox_clear_event_payload payload = { + .event_log = log, + }; + int cnt; + + /* + * Clear Event Records uses u8 for the handle cnt while Get Event + * Record can return up to 0xffff records. + */ + for (cnt = 0; cnt < total; /* cnt incremented internally */) { + u8 nr_recs = min_t(u8, (total - cnt), + CXL_CLEAR_EVENT_MAX_HANDLES); + int i, rc; + + for (i = 0; i < nr_recs; i++, cnt++) { + payload.handle[i] = get_pl->records[cnt].hdr.handle; + dev_dbg(cxlds->dev, "Event log '%s': Clearning %u\n", + cxl_event_log_type_str(log), + le16_to_cpu(payload.handle[i])); + } + payload.nr_recs = nr_recs; + + rc = cxl_mbox_send_cmd(cxlds, CXL_MBOX_OP_CLEAR_EVENT_RECORD, + &payload, sizeof(payload), NULL, 0); + if (rc) + return rc; + } + + return 0; +} + static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, enum cxl_event_log_type type) { @@ -732,13 +769,22 @@ static void cxl_mem_get_records_log(struct cxl_dev_state *cxlds, } nr_rec = le16_to_cpu(payload->record_count); - if (trace_cxl_generic_event_enabled()) { + if (nr_rec > 0) { int i; - for (i = 0; i < nr_rec; i++) - trace_cxl_generic_event(dev_name(cxlds->dev), - type, - &payload->records[i]); + if (trace_cxl_generic_event_enabled()) { + for (i = 0; i < nr_rec; i++) + trace_cxl_generic_event(dev_name(cxlds->dev), + type, + &payload->records[i]); + } + + rc = cxl_clear_event_record(cxlds, type, payload, nr_rec); + if (rc) { + dev_err(cxlds->dev, "Event log '%s': Failed to clear events : %d", + cxl_event_log_type_str(type), rc); + return; + } } if (trace_cxl_overflow_enabled() && @@ -780,10 +826,11 @@ static struct cxl_get_event_payload *alloc_event_buf(struct cxl_dev_state *cxlds * cxl_mem_get_event_records - Get Event Records from the device * @cxlds: The device data for the operation * - * Retrieve all event records available on the device and report them as trace - * events. + * Retrieve all event records available on the device, report them as trace + * events, and clear them. * * See CXL rev 3.0 @8.2.9.2.2 Get Event Records + * See CXL rev 3.0 @8.2.9.2.3 Clear Event Records */ void cxl_mem_get_event_records(struct cxl_dev_state *cxlds) { diff --git a/drivers/cxl/cxlmem.h b/drivers/cxl/cxlmem.h index 55d57f5a64bc..1ae9962c5a06 100644 --- a/drivers/cxl/cxlmem.h +++ b/drivers/cxl/cxlmem.h @@ -261,6 +261,7 @@ enum cxl_opcode { CXL_MBOX_OP_INVALID = 0x0000, CXL_MBOX_OP_RAW = CXL_MBOX_OP_INVALID, CXL_MBOX_OP_GET_EVENT_RECORD = 0x0100, + CXL_MBOX_OP_CLEAR_EVENT_RECORD = 0x0101, CXL_MBOX_OP_GET_FW_INFO = 0x0200, CXL_MBOX_OP_ACTIVATE_FW = 0x0202, CXL_MBOX_OP_GET_SUPPORTED_LOGS = 0x0400, @@ -396,6 +397,19 @@ static inline const char *cxl_event_log_type_str(enum cxl_event_log_type type) return "<unknown>"; } +/* + * Clear Event Records input payload + * CXL rev 3.0 section 8.2.9.2.3; Table 8-51 + */ +#define CXL_CLEAR_EVENT_MAX_HANDLES (0xff) +struct cxl_mbox_clear_event_payload { + u8 event_log; /* enum cxl_event_log_type */ + u8 clear_flags; + u8 nr_recs; + u8 reserved[3]; + __le16 handle[CXL_CLEAR_EVENT_MAX_HANDLES]; +}; + struct cxl_mbox_get_partition_info { __le64 active_volatile_cap; __le64 active_persistent_cap; diff --git a/include/uapi/linux/cxl_mem.h b/include/uapi/linux/cxl_mem.h index 70459be5bdd4..7c1ad8062792 100644 --- a/include/uapi/linux/cxl_mem.h +++ b/include/uapi/linux/cxl_mem.h @@ -25,6 +25,7 @@ ___C(RAW, "Raw device command"), \ ___C(GET_SUPPORTED_LOGS, "Get Supported Logs"), \ ___C(GET_EVENT_RECORD, "Get Event Record"), \ + ___C(CLEAR_EVENT_RECORD, "Clear Event Record"), \ ___C(GET_FW_INFO, "Get FW Info"), \ ___C(GET_PARTITION_INFO, "Get Partition Information"), \ ___C(GET_LSA, "Get Label Storage Area"), \