Message ID | 20231025093847.3740104-4-zengheng4@huawei.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:ce89:0:b0:403:3b70:6f57 with SMTP id p9csp2475071vqx; Wed, 25 Oct 2023 02:34:09 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFv6fhvL/yRbYEKVHrALHZFQu5vjm52XF88cR+/lMkedWwwFjAFzh/G/KyEBc6k1nhnap52 X-Received: by 2002:a25:d013:0:b0:d9b:ff50:b100 with SMTP id h19-20020a25d013000000b00d9bff50b100mr14659361ybg.28.1698226449572; Wed, 25 Oct 2023 02:34:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698226449; cv=none; d=google.com; s=arc-20160816; b=XRuTRjJSMjCHAFX5tW44DDBXD8bDnFx5TkG1Md27tkd19347KJM5SY4Ymaxr2iflof UpK+JWtCHBFL4ZR18NMC/MKgu/A5zA2iKsNg7YnftAM/IZ8Qp8Tlntzfg2LqyE4yrhLr 7T29QFz5EaWEncLDr5BrUltEWGfbltAl3BqqueWerHzR9Kte1psP8WznNBn4xNLhRz3o zri1BzyxDkYeMzKa5zuRo2GH8dLXuw2kVPCsT8OGQARGrVyRHx1zkenQA7aTyhKiSNHH Ha78GVhnks2Ubl4HSB0SRrEnpGTvfL85OZp9L0T/KMagkHHYqVWAhji0IA3g5HSVqpZz ZZNQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=KdJmZIran55B3JvI/C5pXY1BQOJeF2oLnxVnMVw5Ip0=; fh=4OYJ3K2D0ofBlGhh3O4YuBicy23CNIUxbo78mZx4uJc=; b=tYtop0QKghSE1SwaoY0/+5pRDaQjHe2kHUvXbqHbJXno/p4AN8HOfBS3wnSJBG4B/l xQzTW0vaFQEpO41v2ZDCrjvF9/Qbr9WZYxRDVMhduli4TN1jVBZ3DclZcNMFSD61xbzR SYohK29/UbnBViawaN9N/r1z+FjyyLDJcM2/9Kb5NARE0GNoxrJCCNK3J7ILlQKJB4DT rh0+pGfmKNtC7tf2pBED0hJtZ4aIxjYKYwe0P7CpMIv6Yjh0mD8/exhUPlz3Kil4yox4 pfLEiDrV/E3UzG99FK6YKXPqopz302qFxum2velko2hu9qdTa0TcajNvP4bfXwWzZhtT nbTw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: from lipwig.vger.email (lipwig.vger.email. [23.128.96.33]) by mx.google.com with ESMTPS id x14-20020a056902102e00b00d9940128dd1si3583925ybt.738.2023.10.25.02.34.09 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 25 Oct 2023 02:34:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) client-ip=23.128.96.33; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id D02DC8023ED2; Wed, 25 Oct 2023 02:34:06 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234486AbjJYJdv (ORCPT <rfc822;aposhian.dev@gmail.com> + 26 others); Wed, 25 Oct 2023 05:33:51 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39480 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234397AbjJYJdq (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Wed, 25 Oct 2023 05:33:46 -0400 Received: from szxga08-in.huawei.com (szxga08-in.huawei.com [45.249.212.255]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0D4EBDC; Wed, 25 Oct 2023 02:33:44 -0700 (PDT) Received: from kwepemi500024.china.huawei.com (unknown [172.30.72.57]) by szxga08-in.huawei.com (SkyGuard) with ESMTP id 4SFkF60pgzz15NlK; Wed, 25 Oct 2023 17:30:50 +0800 (CST) Received: from huawei.com (10.175.103.91) by kwepemi500024.china.huawei.com (7.221.188.100) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.31; Wed, 25 Oct 2023 17:33:40 +0800 From: Zeng Heng <zengheng4@huawei.com> To: <broonie@kernel.org>, <joey.gouly@arm.com>, <will@kernel.org>, <amit.kachhap@arm.com>, <rafael@kernel.org>, <catalin.marinas@arm.com>, <james.morse@arm.com>, <mark.rutland@arm.com>, <maz@kernel.org>, <viresh.kumar@linaro.org>, <sumitg@nvidia.com>, <yang@os.amperecomputing.com> CC: <linux-kernel@vger.kernel.org>, <linux-pm@vger.kernel.org>, <linux-arm-kernel@lists.infradead.org>, <wangxiongfeng2@huawei.com>, <xiexiuqi@huawei.com> Subject: [PATCH 3/3] cpufreq: CPPC: Eliminate the impact of cpc_read() latency error Date: Wed, 25 Oct 2023 17:38:47 +0800 Message-ID: <20231025093847.3740104-4-zengheng4@huawei.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20231025093847.3740104-1-zengheng4@huawei.com> References: <20231025093847.3740104-1-zengheng4@huawei.com> MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.175.103.91] X-ClientProxiedBy: dggems705-chm.china.huawei.com (10.3.19.182) To kwepemi500024.china.huawei.com (7.221.188.100) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-0.8 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Wed, 25 Oct 2023 02:34:06 -0700 (PDT) X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1780719497550768252 X-GMAIL-MSGID: 1780719497550768252 |
Series |
Make the cpuinfo_cur_freq interface read correctly
|
|
Commit Message
Zeng Heng
Oct. 25, 2023, 9:38 a.m. UTC
We have found significant differences in the latency of cpc_read() between
regular scenarios and scenarios with high memory access pressure. Ignoring
this error can result in getting rate interface occasionally returning
absurd values.
Here provides a high memory access sample test by stress-ng. My local
testing platform includes 160 CPUs, the CPC registers is accessed by mmio
method, and the cpuidle feature is disabled (the AMU always works online):
~~~
./stress-ng --memrate 160 --timeout 180
~~~
The following data is sourced from ftrace statistics towards
cppc_get_perf_ctrs():
Regular scenarios || High memory access pressure scenarios
104) | cppc_get_perf_ctrs() { || 133) | cppc_get_perf_ctrs() {
104) 0.800 us | cpc_read.isra.0(); || 133) 4.580 us | cpc_read.isra.0();
104) 0.640 us | cpc_read.isra.0(); || 133) 7.780 us | cpc_read.isra.0();
104) 0.450 us | cpc_read.isra.0(); || 133) 2.550 us | cpc_read.isra.0();
104) 0.430 us | cpc_read.isra.0(); || 133) 0.570 us | cpc_read.isra.0();
104) 4.610 us | } || 133) ! 157.610 us | }
104) | cppc_get_perf_ctrs() { || 133) | cppc_get_perf_ctrs() {
104) 0.720 us | cpc_read.isra.0(); || 133) 0.760 us | cpc_read.isra.0();
104) 0.720 us | cpc_read.isra.0(); || 133) 4.480 us | cpc_read.isra.0();
104) 0.510 us | cpc_read.isra.0(); || 133) 0.520 us | cpc_read.isra.0();
104) 0.500 us | cpc_read.isra.0(); || 133) + 10.100 us | cpc_read.isra.0();
104) 3.460 us | } || 133) ! 120.850 us | }
108) | cppc_get_perf_ctrs() { || 87) | cppc_get_perf_ctrs() {
108) 0.820 us | cpc_read.isra.0(); || 87) ! 255.200 us | cpc_read.isra.0();
108) 0.850 us | cpc_read.isra.0(); || 87) 2.910 us | cpc_read.isra.0();
108) 0.590 us | cpc_read.isra.0(); || 87) 5.160 us | cpc_read.isra.0();
108) 0.610 us | cpc_read.isra.0(); || 87) 4.340 us | cpc_read.isra.0();
108) 5.080 us | } || 87) ! 315.790 us | }
108) | cppc_get_perf_ctrs() { || 87) | cppc_get_perf_ctrs() {
108) 0.630 us | cpc_read.isra.0(); || 87) 0.800 us | cpc_read.isra.0();
108) 0.630 us | cpc_read.isra.0(); || 87) 6.310 us | cpc_read.isra.0();
108) 0.420 us | cpc_read.isra.0(); || 87) 1.190 us | cpc_read.isra.0();
108) 0.430 us | cpc_read.isra.0(); || 87) + 11.620 us | cpc_read.isra.0();
108) 3.780 us | } || 87) ! 207.010 us | }
My local testing platform works under 3000000hz, but the cpuinfo_cur_freq
interface returns values that are not even close to the actual frequency:
[root@localhost ~]# cd /sys/devices/system/cpu
[root@localhost cpu]# for i in {0..159}; do cat cpu$i/cpufreq/cpuinfo_cur_freq; done
5127812
2952127
3069001
3496183
922989768
2419194
3427042
2331869
3594611
8238499
...
The reason is when under heavy memory access pressure, the execution of
cpc_read() delay has increased from sub-microsecond to several hundred
microseconds. Moving the cpc_read function into a critical section by irq
disable/enable has minimal impact on the result.
cppc_get_perf_ctrs()[0] cppc_get_perf_ctrs()[1]
/ \ / \
cpc_read cpc_read cpc_read cpc_read
ref[0] delivered[0] ref[1] delivered[1]
| | | |
v v v v
-----------------------------------------------------------------------> time
<--delta[0]--> <------sample_period------> <-----delta[1]----->
Since that,
freq = ref_freq * (delivered[1] - delivered[0]) / (ref[1] - ref[0])
and
delivered[1] - delivered[0] = freq * (delta[1] + sample_period),
ref[1] - ref[0] = ref_freq * (delta[0] + sample_period)
To eliminate the impact of system memory access latency, setting a
sampling period of 2us is far from sufficient. Consequently, we suggest
cppc_cpufreq_get_rate() only can be called in the process context, and
adopt a longer sampling period to neutralize the impact of random latency.
Here we call the cond_resched() function instead of sleep-like functions
to ensure that `taskset -c $i cat cpu$i/cpufreq/cpuinfo_cur_freq` could
work when cpuidle feature is enabled.
Reported-by: Yang Shi <yang@os.amperecomputing.com>
Link: https://lore.kernel.org/all/20230328193846.8757-1-yang@os.amperecomputing.com/
Signed-off-by: Zeng Heng <zengheng4@huawei.com>
---
drivers/cpufreq/cppc_cpufreq.c | 16 +++++++++++++++-
1 file changed, 15 insertions(+), 1 deletion(-)
Comments
On Wed, Oct 25, 2023 at 05:38:47PM +0800, Zeng Heng wrote: > We have found significant differences in the latency of cpc_read() between > regular scenarios and scenarios with high memory access pressure. Ignoring > this error can result in getting rate interface occasionally returning > absurd values. > > Here provides a high memory access sample test by stress-ng. My local > testing platform includes 160 CPUs, the CPC registers is accessed by mmio > method, and the cpuidle feature is disabled (the AMU always works online): > > ~~~ > ./stress-ng --memrate 160 --timeout 180 > ~~~ > > The following data is sourced from ftrace statistics towards > cppc_get_perf_ctrs(): > > Regular scenarios || High memory access pressure scenarios > 104) | cppc_get_perf_ctrs() { || 133) | cppc_get_perf_ctrs() { > 104) 0.800 us | cpc_read.isra.0(); || 133) 4.580 us | cpc_read.isra.0(); > 104) 0.640 us | cpc_read.isra.0(); || 133) 7.780 us | cpc_read.isra.0(); > 104) 0.450 us | cpc_read.isra.0(); || 133) 2.550 us | cpc_read.isra.0(); > 104) 0.430 us | cpc_read.isra.0(); || 133) 0.570 us | cpc_read.isra.0(); > 104) 4.610 us | } || 133) ! 157.610 us | } > 104) | cppc_get_perf_ctrs() { || 133) | cppc_get_perf_ctrs() { > 104) 0.720 us | cpc_read.isra.0(); || 133) 0.760 us | cpc_read.isra.0(); > 104) 0.720 us | cpc_read.isra.0(); || 133) 4.480 us | cpc_read.isra.0(); > 104) 0.510 us | cpc_read.isra.0(); || 133) 0.520 us | cpc_read.isra.0(); > 104) 0.500 us | cpc_read.isra.0(); || 133) + 10.100 us | cpc_read.isra.0(); > 104) 3.460 us | } || 133) ! 120.850 us | } > 108) | cppc_get_perf_ctrs() { || 87) | cppc_get_perf_ctrs() { > 108) 0.820 us | cpc_read.isra.0(); || 87) ! 255.200 us | cpc_read.isra.0(); > 108) 0.850 us | cpc_read.isra.0(); || 87) 2.910 us | cpc_read.isra.0(); > 108) 0.590 us | cpc_read.isra.0(); || 87) 5.160 us | cpc_read.isra.0(); > 108) 0.610 us | cpc_read.isra.0(); || 87) 4.340 us | cpc_read.isra.0(); > 108) 5.080 us | } || 87) ! 315.790 us | } > 108) | cppc_get_perf_ctrs() { || 87) | cppc_get_perf_ctrs() { > 108) 0.630 us | cpc_read.isra.0(); || 87) 0.800 us | cpc_read.isra.0(); > 108) 0.630 us | cpc_read.isra.0(); || 87) 6.310 us | cpc_read.isra.0(); > 108) 0.420 us | cpc_read.isra.0(); || 87) 1.190 us | cpc_read.isra.0(); > 108) 0.430 us | cpc_read.isra.0(); || 87) + 11.620 us | cpc_read.isra.0(); > 108) 3.780 us | } || 87) ! 207.010 us | } > > My local testing platform works under 3000000hz, but the cpuinfo_cur_freq > interface returns values that are not even close to the actual frequency: > > [root@localhost ~]# cd /sys/devices/system/cpu > [root@localhost cpu]# for i in {0..159}; do cat cpu$i/cpufreq/cpuinfo_cur_freq; done > 5127812 > 2952127 > 3069001 > 3496183 > 922989768 > 2419194 > 3427042 > 2331869 > 3594611 > 8238499 > ... > > The reason is when under heavy memory access pressure, the execution of > cpc_read() delay has increased from sub-microsecond to several hundred > microseconds. Moving the cpc_read function into a critical section by irq > disable/enable has minimal impact on the result. > > cppc_get_perf_ctrs()[0] cppc_get_perf_ctrs()[1] > / \ / \ > cpc_read cpc_read cpc_read cpc_read > ref[0] delivered[0] ref[1] delivered[1] > | | | | > v v v v > -----------------------------------------------------------------------> time > <--delta[0]--> <------sample_period------> <-----delta[1]-----> > > Since that, > freq = ref_freq * (delivered[1] - delivered[0]) / (ref[1] - ref[0]) > and > delivered[1] - delivered[0] = freq * (delta[1] + sample_period), > ref[1] - ref[0] = ref_freq * (delta[0] + sample_period) > > To eliminate the impact of system memory access latency, setting a > sampling period of 2us is far from sufficient. Consequently, we suggest > cppc_cpufreq_get_rate() only can be called in the process context, and > adopt a longer sampling period to neutralize the impact of random latency. > > Here we call the cond_resched() function instead of sleep-like functions > to ensure that `taskset -c $i cat cpu$i/cpufreq/cpuinfo_cur_freq` could > work when cpuidle feature is enabled. > > Reported-by: Yang Shi <yang@os.amperecomputing.com> > Link: https://lore.kernel.org/all/20230328193846.8757-1-yang@os.amperecomputing.com/ > Signed-off-by: Zeng Heng <zengheng4@huawei.com> > --- > drivers/cpufreq/cppc_cpufreq.c | 16 +++++++++++++++- > 1 file changed, 15 insertions(+), 1 deletion(-) > > diff --git a/drivers/cpufreq/cppc_cpufreq.c b/drivers/cpufreq/cppc_cpufreq.c > index 321a9dc9484d..a7c5418bcda7 100644 > --- a/drivers/cpufreq/cppc_cpufreq.c > +++ b/drivers/cpufreq/cppc_cpufreq.c > @@ -851,12 +851,26 @@ static int cppc_get_perf_ctrs_pair(void *val) The previous patch added this function, and calls it with smp_call_on_cpu(), where it'll run in IRQ context with IRQs disabled... > struct fb_ctr_pair *fb_ctrs = val; > int cpu = fb_ctrs->cpu; > int ret; > + unsigned long timeout; > > ret = cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t0); > if (ret) > return ret; > > - udelay(2); /* 2usec delay between sampling */ > + if (likely(!irqs_disabled())) { > + /* > + * Set 1ms as sampling interval, but never schedule > + * to the idle task to prevent the AMU counters from > + * stopping working. > + */ > + timeout = jiffies + msecs_to_jiffies(1); > + while (!time_after(jiffies, timeout)) > + cond_resched(); > + > + } else { ... so we'll enter this branch of the if-else ... > + pr_warn_once("CPU%d: Get rate in atomic context", cpu); ... and pr_warn_once() for something that's apparently normal and outside of the user's control? That doesn't make much sense to me. Mark. > + udelay(2); /* 2usec delay between sampling */ > + } > > return cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t1); > } > -- > 2.25.1 >
在 2023/10/25 19:01, Mark Rutland 写道: > On Wed, Oct 25, 2023 at 05:38:47PM +0800, Zeng Heng wrote: > > The previous patch added this function, and calls it with smp_call_on_cpu(), > where it'll run in IRQ context with IRQs disabled... smp_call_on_cpu() puts the work to the bind-cpu worker. And this function will be called in task context, and IRQs is certainly enabled. Zeng Heng >> struct fb_ctr_pair *fb_ctrs = val; >> int cpu = fb_ctrs->cpu; >> int ret; >> + unsigned long timeout; >> >> ret = cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t0); >> if (ret) >> return ret; >> >> - udelay(2); /* 2usec delay between sampling */ >> + if (likely(!irqs_disabled())) { >> + /* >> + * Set 1ms as sampling interval, but never schedule >> + * to the idle task to prevent the AMU counters from >> + * stopping working. >> + */ >> + timeout = jiffies + msecs_to_jiffies(1); >> + while (!time_after(jiffies, timeout)) >> + cond_resched(); >> + >> + } else { > ... so we'll enter this branch of the if-else ... > >> + pr_warn_once("CPU%d: Get rate in atomic context", cpu); > ... and pr_warn_once() for something that's apparently normal and outside of > the user's control? > > That doesn't make much sense to me. > > Mark. > >> + udelay(2); /* 2usec delay between sampling */ >> + } >> >> return cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t1); >> } >> -- >> 2.25.1 >>
On Thu, Oct 26, 2023 at 09:55:39AM +0800, Zeng Heng wrote: > > 在 2023/10/25 19:01, Mark Rutland 写道: > > On Wed, Oct 25, 2023 at 05:38:47PM +0800, Zeng Heng wrote: > > > > The previous patch added this function, and calls it with smp_call_on_cpu(), > > where it'll run in IRQ context with IRQs disabled... > > smp_call_on_cpu() puts the work to the bind-cpu worker. Ah, sorry -- I had confused this with the smp_call_function*() family, which do this in IRQ context. > And this function will be called in task context, and IRQs is certainly enabled. Understood; given that, please ignore my comments below. Mark. > > > Zeng Heng > > > > struct fb_ctr_pair *fb_ctrs = val; > > > int cpu = fb_ctrs->cpu; > > > int ret; > > > + unsigned long timeout; > > > ret = cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t0); > > > if (ret) > > > return ret; > > > - udelay(2); /* 2usec delay between sampling */ > > > + if (likely(!irqs_disabled())) { > > > + /* > > > + * Set 1ms as sampling interval, but never schedule > > > + * to the idle task to prevent the AMU counters from > > > + * stopping working. > > > + */ > > > + timeout = jiffies + msecs_to_jiffies(1); > > > + while (!time_after(jiffies, timeout)) > > > + cond_resched(); > > > + > > > + } else { > > ... so we'll enter this branch of the if-else ... > > > > > + pr_warn_once("CPU%d: Get rate in atomic context", cpu); > > ... and pr_warn_once() for something that's apparently normal and outside of > > the user's control? > > > > That doesn't make much sense to me. > > > > Mark. > > > > > + udelay(2); /* 2usec delay between sampling */ > > > + } > > > return cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t1); > > > } > > > -- > > > 2.25.1 > > >
diff --git a/drivers/cpufreq/cppc_cpufreq.c b/drivers/cpufreq/cppc_cpufreq.c index 321a9dc9484d..a7c5418bcda7 100644 --- a/drivers/cpufreq/cppc_cpufreq.c +++ b/drivers/cpufreq/cppc_cpufreq.c @@ -851,12 +851,26 @@ static int cppc_get_perf_ctrs_pair(void *val) struct fb_ctr_pair *fb_ctrs = val; int cpu = fb_ctrs->cpu; int ret; + unsigned long timeout; ret = cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t0); if (ret) return ret; - udelay(2); /* 2usec delay between sampling */ + if (likely(!irqs_disabled())) { + /* + * Set 1ms as sampling interval, but never schedule + * to the idle task to prevent the AMU counters from + * stopping working. + */ + timeout = jiffies + msecs_to_jiffies(1); + while (!time_after(jiffies, timeout)) + cond_resched(); + + } else { + pr_warn_once("CPU%d: Get rate in atomic context", cpu); + udelay(2); /* 2usec delay between sampling */ + } return cppc_get_perf_ctrs(cpu, &fb_ctrs->fb_ctrs_t1); }