Message ID | 20221229124728.66515-1-yangjihong1@huawei.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4e01:0:0:0:0:0 with SMTP id p1csp2384471wrt; Thu, 29 Dec 2022 04:53:13 -0800 (PST) X-Google-Smtp-Source: AMrXdXsduTHGPF1kSk2xf8fWnPI0kmlKXTMOWKus68p5eq5ohV/1Qxqu3x8tnQEq6Aw0y9uiknU8 X-Received: by 2002:a17:902:8d93:b0:192:6990:ba60 with SMTP id v19-20020a1709028d9300b001926990ba60mr16424463plo.63.1672318393505; Thu, 29 Dec 2022 04:53:13 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1672318393; cv=none; d=google.com; s=arc-20160816; b=rZED2lrv8JeNLI9jzRtPLCrghy+DlSxCkS/If/qS+nCK43agt43JtSBqlCdpgcNb5y xM15cgMLyC/t0VJA2Hm1QNcc3T1+wUw5Dy3QZnHGhXj9IUt+qSloNjpi1IZE6ce1qlHN YE6gtlGwYbZx8qNJd/DhFsGxHQfjO/+2n9bmrq4LoP9u+afaOp/AhdcVMK17eMxJnvMb yAMS+uHiD7rXhVEkqMMr4fCI+S1hC5XsQKIHqEPl1JWsBlKvV6LneJUIYFxg18HOw7pC PwwKPUtsoeD2fnHF9p1KXEQhlSbRIPZPSW2uu2tP11ErKO8dDUHW5mDX2S6zB9WVUAda TPTA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=On0Fe7xfBVxDjSgT6Spkr6yrwx+dJrmOvtjZzipPrB0=; b=Dnvm5bJVtJObfOwupfRGJtUenJDx06s83o2/CN408dAwLVxyFPRl7VIJVZqXVSvye2 862PsuxLl42VQUDGU1z+unxPt6tsw5c7DuFJj9ETmU8ol2MNi73UZX2Lk0b8fD1BE4Mh v1c6GkptSeh891SJVuhI6YEEy1qq9ikOe4HJ4Vaq3DQK2D/j1J3IW2cfpbKoZOfR7Sp/ jeEURodl2ozzsdJzxFimvlFkkmL5AemovzNr2cphgMCpkVSQLG2Bc4yQixmGymOp+h3E SprnyFEtgoyzEe094PYy2DGn2ZQRR62xNiOFEAzLfp9OnSw6jPlDP2Tn/dNgRH/H3R9q 7UPQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id k14-20020a170902d58e00b00187480df4b3si14074734plh.277.2022.12.29.04.53.01; Thu, 29 Dec 2022 04:53:13 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233445AbiL2MwV (ORCPT <rfc822;eddaouddi.ayoub@gmail.com> + 99 others); Thu, 29 Dec 2022 07:52:21 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40972 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233539AbiL2Mva (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Thu, 29 Dec 2022 07:51:30 -0500 Received: from szxga08-in.huawei.com (szxga08-in.huawei.com [45.249.212.255]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4C8A01408D; Thu, 29 Dec 2022 04:51:08 -0800 (PST) Received: from kwepemm600003.china.huawei.com (unknown [172.30.72.57]) by szxga08-in.huawei.com (SkyGuard) with ESMTP id 4NjSs80mMqz16Lvr; Thu, 29 Dec 2022 20:49:48 +0800 (CST) Received: from localhost.localdomain (10.67.174.95) by kwepemm600003.china.huawei.com (7.193.23.202) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.34; Thu, 29 Dec 2022 20:51:05 +0800 From: Yang Jihong <yangjihong1@huawei.com> To: <peterz@infradead.org>, <mingo@redhat.com>, <acme@kernel.org>, <mark.rutland@arm.com>, <alexander.shishkin@linux.intel.com>, <jolsa@kernel.org>, <namhyung@kernel.org>, <jiwei.sun@windriver.com>, <linux-perf-users@vger.kernel.org>, <linux-kernel@vger.kernel.org> CC: <yangjihong1@huawei.com> Subject: [PATCH v2] perf record: Fix coredump with --overwrite and --max-size Date: Thu, 29 Dec 2022 12:47:28 +0000 Message-ID: <20221229124728.66515-1-yangjihong1@huawei.com> X-Mailer: git-send-email 2.30.GIT MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.67.174.95] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To kwepemm600003.china.huawei.com (7.193.23.202) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1753373881971073210?= X-GMAIL-MSGID: =?utf-8?q?1753552932146759951?= |
Series |
[v2] perf record: Fix coredump with --overwrite and --max-size
|
|
Commit Message
Yang Jihong
Dec. 29, 2022, 12:47 p.m. UTC
When --overwrite and --max-size options of perf record are used together,
a segmentation fault occurs. The following is an example:
# perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1
[ perf record: Woken up 1 times to write data ]
perf: Segmentation fault
Obtained 1 stack frames.
[0xc4c67f]
Segmentation fault (core dumped)
backtrace of the core file is as follows:
#0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630
#1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950
#2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936
#3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947
#4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010
#5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810
#6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837
#7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938
#8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241
#9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117
#10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56
#11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410
#12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431
#13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562
The reason is that record__bytes_written accesses the freed memory rec->thread_data,
The process is as follows:
__cmd_record
-> record__free_thread_data
-> zfree(&rec->thread_data) // free rec->thread_data
-> record__synthesize
-> perf_event__synthesize_id_index
-> process_synthesized_event
-> record__write
-> record__bytes_written // access rec->thread_data
we only need to check the value of done first.
Also add variable check in record__bytes_written for code hardening,
and save bytes_written separately to reduce one calculation.
Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size")
Signed-off-by: Yang Jihong <yangjihong1@huawei.com>
---
Changes since v1:
- Add variable check in record__bytes_written for code hardening.
- Save bytes_written separately to reduce one calculation.
- Remove rec->opts.tail_synthesize check.
tools/perf/builtin-record.c | 26 +++++++++++++++++---------
1 file changed, 17 insertions(+), 9 deletions(-)
Comments
Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: > When --overwrite and --max-size options of perf record are used together, > a segmentation fault occurs. The following is an example: > > # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 > [ perf record: Woken up 1 times to write data ] > perf: Segmentation fault > Obtained 1 stack frames. > [0xc4c67f] > Segmentation fault (core dumped) > > backtrace of the core file is as follows: > > #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 > #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 > #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 > #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 > #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 > #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 > #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 > #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 > #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 > #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 > #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 > #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 > #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 > #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 > > The reason is that record__bytes_written accesses the freed memory rec->thread_data, > The process is as follows: > __cmd_record > -> record__free_thread_data > -> zfree(&rec->thread_data) // free rec->thread_data > -> record__synthesize > -> perf_event__synthesize_id_index > -> process_synthesized_event > -> record__write > -> record__bytes_written // access rec->thread_data > > we only need to check the value of done first. > Also add variable check in record__bytes_written for code hardening, > and save bytes_written separately to reduce one calculation. > > Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") > Signed-off-by: Yang Jihong <yangjihong1@huawei.com> > --- > > Changes since v1: > - Add variable check in record__bytes_written for code hardening. > - Save bytes_written separately to reduce one calculation. > - Remove rec->opts.tail_synthesize check. Namhyung, are you ok with this now? - Arnaldo > tools/perf/builtin-record.c | 26 +++++++++++++++++--------- > 1 file changed, 17 insertions(+), 9 deletions(-) > > diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > index 29dcd454b8e2..acba9e43e519 100644 > --- a/tools/perf/builtin-record.c > +++ b/tools/perf/builtin-record.c > @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) > u64 bytes_written = rec->bytes_written; > struct record_thread *thread_data = rec->thread_data; > > + if (thread_data == NULL) > + return bytes_written; > + > for (t = 0; t < rec->nr_threads; t++) > bytes_written += thread_data[t].bytes_written; > > return bytes_written; > } > > -static bool record__output_max_size_exceeded(struct record *rec) > +static void record__check_output_max_size_exceeded(struct record *rec) > { > - return rec->output_max_size && > - (record__bytes_written(rec) >= rec->output_max_size); > + u64 bytes_written; > + > + if (rec->output_max_size == 0 || done) > + return; > + > + bytes_written = record__bytes_written(rec); > + if (bytes_written >= rec->output_max_size) { > + fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," > + " stopping session ]\n", bytes_written >> 10); > + > + done = 1; > + } > } > > static int record__write(struct record *rec, struct mmap *map __maybe_unused, > @@ -260,12 +273,7 @@ static int record__write(struct record *rec, struct mmap *map __maybe_unused, > else > rec->bytes_written += size; > > - if (record__output_max_size_exceeded(rec) && !done) { > - fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," > - " stopping session ]\n", > - record__bytes_written(rec) >> 10); > - done = 1; > - } > + record__check_output_max_size_exceeded(rec); > > if (switch_output_size(rec)) > trigger_hit(&switch_output_trigger); > -- > 2.30.GIT
On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > > Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: > > When --overwrite and --max-size options of perf record are used together, > > a segmentation fault occurs. The following is an example: > > > > # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 > > [ perf record: Woken up 1 times to write data ] > > perf: Segmentation fault > > Obtained 1 stack frames. > > [0xc4c67f] > > Segmentation fault (core dumped) > > > > backtrace of the core file is as follows: > > > > #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 > > #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 > > #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 > > #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 > > #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 > > #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 > > #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 > > #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 > > #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 > > #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 > > #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 > > #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 > > #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 > > #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 > > > > The reason is that record__bytes_written accesses the freed memory rec->thread_data, > > The process is as follows: > > __cmd_record > > -> record__free_thread_data > > -> zfree(&rec->thread_data) // free rec->thread_data > > -> record__synthesize > > -> perf_event__synthesize_id_index > > -> process_synthesized_event > > -> record__write > > -> record__bytes_written // access rec->thread_data > > > > we only need to check the value of done first. > > Also add variable check in record__bytes_written for code hardening, > > and save bytes_written separately to reduce one calculation. > > > > Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") > > Signed-off-by: Yang Jihong <yangjihong1@huawei.com> > > --- > > > > Changes since v1: > > - Add variable check in record__bytes_written for code hardening. > > - Save bytes_written separately to reduce one calculation. > > - Remove rec->opts.tail_synthesize check. > > Namhyung, are you ok with this now? > > - Arnaldo > > > tools/perf/builtin-record.c | 26 +++++++++++++++++--------- > > 1 file changed, 17 insertions(+), 9 deletions(-) > > > > diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > > index 29dcd454b8e2..acba9e43e519 100644 > > --- a/tools/perf/builtin-record.c > > +++ b/tools/perf/builtin-record.c > > @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) > > u64 bytes_written = rec->bytes_written; > > struct record_thread *thread_data = rec->thread_data; > > > > + if (thread_data == NULL) > > + return bytes_written; > > + Then it won't count bytes written by threads, right? I think it needs to be saved somewhere. Thanks, Namhyung > > for (t = 0; t < rec->nr_threads; t++) > > bytes_written += thread_data[t].bytes_written; > > > > return bytes_written; > > } > > > > -static bool record__output_max_size_exceeded(struct record *rec) > > +static void record__check_output_max_size_exceeded(struct record *rec) > > { > > - return rec->output_max_size && > > - (record__bytes_written(rec) >= rec->output_max_size); > > + u64 bytes_written; > > + > > + if (rec->output_max_size == 0 || done) > > + return; > > + > > + bytes_written = record__bytes_written(rec); > > + if (bytes_written >= rec->output_max_size) { > > + fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," > > + " stopping session ]\n", bytes_written >> 10); > > + > > + done = 1; > > + } > > } > > > > static int record__write(struct record *rec, struct mmap *map __maybe_unused, > > @@ -260,12 +273,7 @@ static int record__write(struct record *rec, struct mmap *map __maybe_unused, > > else > > rec->bytes_written += size; > > > > - if (record__output_max_size_exceeded(rec) && !done) { > > - fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," > > - " stopping session ]\n", > > - record__bytes_written(rec) >> 10); > > - done = 1; > > - } > > + record__check_output_max_size_exceeded(rec); > > > > if (switch_output_size(rec)) > > trigger_hit(&switch_output_trigger); > > -- > > 2.30.GIT > > -- > > - Arnaldo
Hello, On 2023/1/4 0:50, Namhyung Kim wrote: > On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: >> >> Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: >>> When --overwrite and --max-size options of perf record are used together, >>> a segmentation fault occurs. The following is an example: >>> >>> # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 >>> [ perf record: Woken up 1 times to write data ] >>> perf: Segmentation fault >>> Obtained 1 stack frames. >>> [0xc4c67f] >>> Segmentation fault (core dumped) >>> >>> backtrace of the core file is as follows: >>> >>> #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 >>> #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 >>> #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 >>> #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 >>> #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 >>> #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 >>> #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 >>> #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 >>> #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 >>> #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 >>> #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 >>> #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 >>> #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 >>> #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 >>> >>> The reason is that record__bytes_written accesses the freed memory rec->thread_data, >>> The process is as follows: >>> __cmd_record >>> -> record__free_thread_data >>> -> zfree(&rec->thread_data) // free rec->thread_data >>> -> record__synthesize >>> -> perf_event__synthesize_id_index >>> -> process_synthesized_event >>> -> record__write >>> -> record__bytes_written // access rec->thread_data >>> >>> we only need to check the value of done first. >>> Also add variable check in record__bytes_written for code hardening, >>> and save bytes_written separately to reduce one calculation. >>> >>> Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") >>> Signed-off-by: Yang Jihong <yangjihong1@huawei.com> >>> --- >>> >>> Changes since v1: >>> - Add variable check in record__bytes_written for code hardening. >>> - Save bytes_written separately to reduce one calculation. >>> - Remove rec->opts.tail_synthesize check. >> >> Namhyung, are you ok with this now? >> >> - Arnaldo >> >>> tools/perf/builtin-record.c | 26 +++++++++++++++++--------- >>> 1 file changed, 17 insertions(+), 9 deletions(-) >>> >>> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c >>> index 29dcd454b8e2..acba9e43e519 100644 >>> --- a/tools/perf/builtin-record.c >>> +++ b/tools/perf/builtin-record.c >>> @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) >>> u64 bytes_written = rec->bytes_written; >>> struct record_thread *thread_data = rec->thread_data; >>> >>> + if (thread_data == NULL) >>> + return bytes_written; >>> + > > Then it won't count bytes written by threads, right? > I think it needs to be saved somewhere. > I'm not sure here. Can you explain it more clearly, thanks :) I can modify it accordingly. I think if thread_data == NULL, it is not thread data. In this case, we just return rec->bytes_written. Thanks, Yang
Hello, On Wed, Jan 4, 2023 at 8:09 PM Yang Jihong <yangjihong1@huawei.com> wrote: > > Hello, > > On 2023/1/4 0:50, Namhyung Kim wrote: > > On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > >> > >> Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: > >>> When --overwrite and --max-size options of perf record are used together, > >>> a segmentation fault occurs. The following is an example: > >>> > >>> # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 > >>> [ perf record: Woken up 1 times to write data ] > >>> perf: Segmentation fault > >>> Obtained 1 stack frames. > >>> [0xc4c67f] > >>> Segmentation fault (core dumped) > >>> > >>> backtrace of the core file is as follows: > >>> > >>> #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 > >>> #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 > >>> #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 > >>> #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 > >>> #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 > >>> #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 > >>> #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 > >>> #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 > >>> #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 > >>> #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 > >>> #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 > >>> #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 > >>> #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 > >>> #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 > >>> > >>> The reason is that record__bytes_written accesses the freed memory rec->thread_data, > >>> The process is as follows: > >>> __cmd_record > >>> -> record__free_thread_data > >>> -> zfree(&rec->thread_data) // free rec->thread_data > >>> -> record__synthesize > >>> -> perf_event__synthesize_id_index > >>> -> process_synthesized_event > >>> -> record__write > >>> -> record__bytes_written // access rec->thread_data > >>> > >>> we only need to check the value of done first. > >>> Also add variable check in record__bytes_written for code hardening, > >>> and save bytes_written separately to reduce one calculation. > >>> > >>> Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") > >>> Signed-off-by: Yang Jihong <yangjihong1@huawei.com> > >>> --- > >>> > >>> Changes since v1: > >>> - Add variable check in record__bytes_written for code hardening. > >>> - Save bytes_written separately to reduce one calculation. > >>> - Remove rec->opts.tail_synthesize check. > >> > >> Namhyung, are you ok with this now? > >> > >> - Arnaldo > >> > >>> tools/perf/builtin-record.c | 26 +++++++++++++++++--------- > >>> 1 file changed, 17 insertions(+), 9 deletions(-) > >>> > >>> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > >>> index 29dcd454b8e2..acba9e43e519 100644 > >>> --- a/tools/perf/builtin-record.c > >>> +++ b/tools/perf/builtin-record.c > >>> @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) > >>> u64 bytes_written = rec->bytes_written; > >>> struct record_thread *thread_data = rec->thread_data; > >>> > >>> + if (thread_data == NULL) > >>> + return bytes_written; > >>> + > > > > Then it won't count bytes written by threads, right? > > I think it needs to be saved somewhere. > > > I'm not sure here. Can you explain it more clearly, thanks :) > I can modify it accordingly. > > I think if thread_data == NULL, it is not thread data. > In this case, we just return rec->bytes_written. It can be thread data but freed before tail synthesis, right? In that case, I think it needs to add bytes_written by threads to calculate the correct data size. Thanks, Namhyung
Hello, On 2023/1/7 5:12, Namhyung Kim wrote: > Hello, > > On Wed, Jan 4, 2023 at 8:09 PM Yang Jihong <yangjihong1@huawei.com> wrote: >> >> Hello, >> >> On 2023/1/4 0:50, Namhyung Kim wrote: >>> On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: >>>> >>>> Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: >>>>> When --overwrite and --max-size options of perf record are used together, >>>>> a segmentation fault occurs. The following is an example: >>>>> >>>>> # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 >>>>> [ perf record: Woken up 1 times to write data ] >>>>> perf: Segmentation fault >>>>> Obtained 1 stack frames. >>>>> [0xc4c67f] >>>>> Segmentation fault (core dumped) >>>>> >>>>> backtrace of the core file is as follows: >>>>> >>>>> #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 >>>>> #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 >>>>> #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 >>>>> #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 >>>>> #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 >>>>> #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 >>>>> #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 >>>>> #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 >>>>> #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 >>>>> #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 >>>>> #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 >>>>> #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 >>>>> #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 >>>>> #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 >>>>> >>>>> The reason is that record__bytes_written accesses the freed memory rec->thread_data, >>>>> The process is as follows: >>>>> __cmd_record >>>>> -> record__free_thread_data >>>>> -> zfree(&rec->thread_data) // free rec->thread_data >>>>> -> record__synthesize >>>>> -> perf_event__synthesize_id_index >>>>> -> process_synthesized_event >>>>> -> record__write >>>>> -> record__bytes_written // access rec->thread_data >>>>> >>>>> we only need to check the value of done first. >>>>> Also add variable check in record__bytes_written for code hardening, >>>>> and save bytes_written separately to reduce one calculation. >>>>> >>>>> Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") >>>>> Signed-off-by: Yang Jihong <yangjihong1@huawei.com> >>>>> --- >>>>> >>>>> Changes since v1: >>>>> - Add variable check in record__bytes_written for code hardening. >>>>> - Save bytes_written separately to reduce one calculation. >>>>> - Remove rec->opts.tail_synthesize check. >>>> >>>> Namhyung, are you ok with this now? >>>> >>>> - Arnaldo >>>> >>>>> tools/perf/builtin-record.c | 26 +++++++++++++++++--------- >>>>> 1 file changed, 17 insertions(+), 9 deletions(-) >>>>> >>>>> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c >>>>> index 29dcd454b8e2..acba9e43e519 100644 >>>>> --- a/tools/perf/builtin-record.c >>>>> +++ b/tools/perf/builtin-record.c >>>>> @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) >>>>> u64 bytes_written = rec->bytes_written; >>>>> struct record_thread *thread_data = rec->thread_data; >>>>> >>>>> + if (thread_data == NULL) >>>>> + return bytes_written; >>>>> + >>> >>> Then it won't count bytes written by threads, right? >>> I think it needs to be saved somewhere. >>> >> I'm not sure here. Can you explain it more clearly, thanks :) >> I can modify it accordingly. >> >> I think if thread_data == NULL, it is not thread data. >> In this case, we just return rec->bytes_written. > > It can be thread data but freed before tail synthesis, right? > In that case, I think it needs to add bytes_written by threads > to calculate the correct data size. Em... In the __cmd_record function, record__stop_threads is called before record__free_thread_data, so if the thread has been freed, there will be no thread data. I think it's okay to ignore the situation you mentioned above. Thanks, Yang
On Sun, Jan 8, 2023 at 6:47 PM Yang Jihong <yangjihong1@huawei.com> wrote: > > Hello, > > On 2023/1/7 5:12, Namhyung Kim wrote: > > Hello, > > > > On Wed, Jan 4, 2023 at 8:09 PM Yang Jihong <yangjihong1@huawei.com> wrote: > >> > >> Hello, > >> > >> On 2023/1/4 0:50, Namhyung Kim wrote: > >>> On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: > >>>> > >>>> Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: > >>>>> When --overwrite and --max-size options of perf record are used together, > >>>>> a segmentation fault occurs. The following is an example: > >>>>> > >>>>> # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 > >>>>> [ perf record: Woken up 1 times to write data ] > >>>>> perf: Segmentation fault > >>>>> Obtained 1 stack frames. > >>>>> [0xc4c67f] > >>>>> Segmentation fault (core dumped) > >>>>> > >>>>> backtrace of the core file is as follows: > >>>>> > >>>>> #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 > >>>>> #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 > >>>>> #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 > >>>>> #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 > >>>>> #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 > >>>>> #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 > >>>>> #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 > >>>>> #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 > >>>>> #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 > >>>>> #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 > >>>>> #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 > >>>>> #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 > >>>>> #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 > >>>>> #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 > >>>>> > >>>>> The reason is that record__bytes_written accesses the freed memory rec->thread_data, > >>>>> The process is as follows: > >>>>> __cmd_record > >>>>> -> record__free_thread_data > >>>>> -> zfree(&rec->thread_data) // free rec->thread_data > >>>>> -> record__synthesize > >>>>> -> perf_event__synthesize_id_index > >>>>> -> process_synthesized_event > >>>>> -> record__write > >>>>> -> record__bytes_written // access rec->thread_data > >>>>> > >>>>> we only need to check the value of done first. > >>>>> Also add variable check in record__bytes_written for code hardening, > >>>>> and save bytes_written separately to reduce one calculation. > >>>>> > >>>>> Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") > >>>>> Signed-off-by: Yang Jihong <yangjihong1@huawei.com> > >>>>> --- > >>>>> > >>>>> Changes since v1: > >>>>> - Add variable check in record__bytes_written for code hardening. > >>>>> - Save bytes_written separately to reduce one calculation. > >>>>> - Remove rec->opts.tail_synthesize check. > >>>> > >>>> Namhyung, are you ok with this now? > >>>> > >>>> - Arnaldo > >>>> > >>>>> tools/perf/builtin-record.c | 26 +++++++++++++++++--------- > >>>>> 1 file changed, 17 insertions(+), 9 deletions(-) > >>>>> > >>>>> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c > >>>>> index 29dcd454b8e2..acba9e43e519 100644 > >>>>> --- a/tools/perf/builtin-record.c > >>>>> +++ b/tools/perf/builtin-record.c > >>>>> @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) > >>>>> u64 bytes_written = rec->bytes_written; > >>>>> struct record_thread *thread_data = rec->thread_data; > >>>>> > >>>>> + if (thread_data == NULL) > >>>>> + return bytes_written; > >>>>> + > >>> > >>> Then it won't count bytes written by threads, right? > >>> I think it needs to be saved somewhere. > >>> > >> I'm not sure here. Can you explain it more clearly, thanks :) > >> I can modify it accordingly. > >> > >> I think if thread_data == NULL, it is not thread data. > >> In this case, we just return rec->bytes_written. > > > > It can be thread data but freed before tail synthesis, right? > > In that case, I think it needs to add bytes_written by threads > > to calculate the correct data size. > Em... In the __cmd_record function, record__stop_threads is called > before record__free_thread_data, so if the thread has been freed, there > will be no thread data. > I think it's okay to ignore the situation you mentioned above. Right, the thread data is already freed, but we need the size. I think it didn't (and won't) update to rec->bytes_written for the data written by the threads (data.X file) because it's only for the main 'data' file. So record__bytes_written() will return a smaller number after the threads are gone. But I think it should return the total data size. Thanks, Namhyung
Hello, On 2023/1/11 3:21, Namhyung Kim wrote: > On Sun, Jan 8, 2023 at 6:47 PM Yang Jihong <yangjihong1@huawei.com> wrote: >> >> Hello, >> >> On 2023/1/7 5:12, Namhyung Kim wrote: >>> Hello, >>> >>> On Wed, Jan 4, 2023 at 8:09 PM Yang Jihong <yangjihong1@huawei.com> wrote: >>>> >>>> Hello, >>>> >>>> On 2023/1/4 0:50, Namhyung Kim wrote: >>>>> On Mon, Jan 2, 2023 at 8:20 AM Arnaldo Carvalho de Melo <acme@kernel.org> wrote: >>>>>> >>>>>> Em Thu, Dec 29, 2022 at 12:47:28PM +0000, Yang Jihong escreveu: >>>>>>> When --overwrite and --max-size options of perf record are used together, >>>>>>> a segmentation fault occurs. The following is an example: >>>>>>> >>>>>>> # perf record -e sched:sched* --overwrite --max-size 1M -a -- sleep 1 >>>>>>> [ perf record: Woken up 1 times to write data ] >>>>>>> perf: Segmentation fault >>>>>>> Obtained 1 stack frames. >>>>>>> [0xc4c67f] >>>>>>> Segmentation fault (core dumped) >>>>>>> >>>>>>> backtrace of the core file is as follows: >>>>>>> >>>>>>> #0 0x0000000000417990 in process_locked_synthesized_event (tool=0x0, event=0x15, sample=0x1de0, machine=0xf8) at builtin-record.c:630 >>>>>>> #1 0x000000000057ee53 in perf_event__synthesize_threads (nr_threads_synthesize=21, mmap_data=<optimized out>, needs_mmap=<optimized out>, machine=0x17ad9b0, process=<optimized out>, tool=0x0) at util/synthetic-events.c:1950 >>>>>>> #2 __machine__synthesize_threads (nr_threads_synthesize=0, data_mmap=<optimized out>, needs_mmap=<optimized out>, process=<optimized out>, threads=0x8, target=0x8, tool=0x0, machine=0x17ad9b0) at util/synthetic-events.c:1936 >>>>>>> #3 machine__synthesize_threads (machine=0x17ad9b0, target=0x8, threads=0x8, needs_mmap=<optimized out>, data_mmap=<optimized out>, nr_threads_synthesize=0) at util/synthetic-events.c:1947 >>>>>>> #4 0x000000000040165d in record__synthesize (tail=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2010 >>>>>>> #5 0x0000000000403989 in __cmd_record (argc=<optimized out>, argv=<optimized out>, rec=0xbe2520 <record>) at builtin-record.c:2810 >>>>>>> #6 0x00000000004196ba in record__init_thread_user_masks (rec=0xbe2520 <record>, cpus=0x17a65f0) at builtin-record.c:3837 >>>>>>> #7 record__init_thread_masks (rec=0xbe2520 <record>) at builtin-record.c:3938 >>>>>>> #8 cmd_record (argc=1, argv=0x7ffdd692dc60) at builtin-record.c:4241 >>>>>>> #9 0x00000000004b701d in pager_command_config (var=0x0, value=0x15 <error: Cannot access memory at address 0x15>, data=0x1de0) at perf.c:117 >>>>>>> #10 0x00000000004b732b in get_leaf_frame_caller_aarch64 (sample=0xfffffffb, thread=0x0, usr_idx=<optimized out>) at util/arm64-frame-pointer-unwind-support.c:56 >>>>>>> #11 0x0000000000406331 in execv_dashed_external (argv=0x7ffdd692d9e8) at perf.c:410 >>>>>>> #12 run_argv (argcp=<synthetic pointer>, argv=<synthetic pointer>) at perf.c:431 >>>>>>> #13 main (argc=<optimized out>, argv=0x7ffdd692d9e8) at perf.c:562 >>>>>>> >>>>>>> The reason is that record__bytes_written accesses the freed memory rec->thread_data, >>>>>>> The process is as follows: >>>>>>> __cmd_record >>>>>>> -> record__free_thread_data >>>>>>> -> zfree(&rec->thread_data) // free rec->thread_data >>>>>>> -> record__synthesize >>>>>>> -> perf_event__synthesize_id_index >>>>>>> -> process_synthesized_event >>>>>>> -> record__write >>>>>>> -> record__bytes_written // access rec->thread_data >>>>>>> >>>>>>> we only need to check the value of done first. >>>>>>> Also add variable check in record__bytes_written for code hardening, >>>>>>> and save bytes_written separately to reduce one calculation. >>>>>>> >>>>>>> Fixes: 6d57581659f7 ("perf record: Add support for limit perf output file size") >>>>>>> Signed-off-by: Yang Jihong <yangjihong1@huawei.com> >>>>>>> --- >>>>>>> >>>>>>> Changes since v1: >>>>>>> - Add variable check in record__bytes_written for code hardening. >>>>>>> - Save bytes_written separately to reduce one calculation. >>>>>>> - Remove rec->opts.tail_synthesize check. >>>>>> >>>>>> Namhyung, are you ok with this now? >>>>>> >>>>>> - Arnaldo >>>>>> >>>>>>> tools/perf/builtin-record.c | 26 +++++++++++++++++--------- >>>>>>> 1 file changed, 17 insertions(+), 9 deletions(-) >>>>>>> >>>>>>> diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c >>>>>>> index 29dcd454b8e2..acba9e43e519 100644 >>>>>>> --- a/tools/perf/builtin-record.c >>>>>>> +++ b/tools/perf/builtin-record.c >>>>>>> @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) >>>>>>> u64 bytes_written = rec->bytes_written; >>>>>>> struct record_thread *thread_data = rec->thread_data; >>>>>>> >>>>>>> + if (thread_data == NULL) >>>>>>> + return bytes_written; >>>>>>> + >>>>> >>>>> Then it won't count bytes written by threads, right? >>>>> I think it needs to be saved somewhere. >>>>> >>>> I'm not sure here. Can you explain it more clearly, thanks :) >>>> I can modify it accordingly. >>>> >>>> I think if thread_data == NULL, it is not thread data. >>>> In this case, we just return rec->bytes_written. >>> >>> It can be thread data but freed before tail synthesis, right? >>> In that case, I think it needs to add bytes_written by threads >>> to calculate the correct data size. >> Em... In the __cmd_record function, record__stop_threads is called >> before record__free_thread_data, so if the thread has been freed, there >> will be no thread data. >> I think it's okay to ignore the situation you mentioned above. > > Right, the thread data is already freed, but we need the size. > > I think it didn't (and won't) update to rec->bytes_written for the data > written by the threads (data.X file) because it's only for the main > 'data' file. So record__bytes_written() will return a smaller number > after the threads are gone. But I think it should return the total > data size. > Yes, the total data size including data.X file should be returned here to fit the semantics, so there's a problem here, too. will fix in next version. Thanks, Yang
diff --git a/tools/perf/builtin-record.c b/tools/perf/builtin-record.c index 29dcd454b8e2..acba9e43e519 100644 --- a/tools/perf/builtin-record.c +++ b/tools/perf/builtin-record.c @@ -230,16 +230,29 @@ static u64 record__bytes_written(struct record *rec) u64 bytes_written = rec->bytes_written; struct record_thread *thread_data = rec->thread_data; + if (thread_data == NULL) + return bytes_written; + for (t = 0; t < rec->nr_threads; t++) bytes_written += thread_data[t].bytes_written; return bytes_written; } -static bool record__output_max_size_exceeded(struct record *rec) +static void record__check_output_max_size_exceeded(struct record *rec) { - return rec->output_max_size && - (record__bytes_written(rec) >= rec->output_max_size); + u64 bytes_written; + + if (rec->output_max_size == 0 || done) + return; + + bytes_written = record__bytes_written(rec); + if (bytes_written >= rec->output_max_size) { + fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," + " stopping session ]\n", bytes_written >> 10); + + done = 1; + } } static int record__write(struct record *rec, struct mmap *map __maybe_unused, @@ -260,12 +273,7 @@ static int record__write(struct record *rec, struct mmap *map __maybe_unused, else rec->bytes_written += size; - if (record__output_max_size_exceeded(rec) && !done) { - fprintf(stderr, "[ perf record: perf size limit reached (%" PRIu64 " KB)," - " stopping session ]\n", - record__bytes_written(rec) >> 10); - done = 1; - } + record__check_output_max_size_exceeded(rec); if (switch_output_size(rec)) trigger_hit(&switch_output_trigger);