[v6,2/2] cpuidle: teo: Introduce util-awareness

  Modern interactive systems, such as recent Android phones, tend to have power
efficient shallow idle states. Selecting deeper idle states on a device while a
latency-sensitive workload is running can adversely impact performance due to
increased latency. Additionally, if the CPU wakes up from a deeper sleep before
its target residency as is often the case, it results in a waste of energy on
top of that.

At the moment, none of the available idle governors take any scheduling
information into account. They also tend to overestimate the idle
duration quite often, which causes them to select excessively deep idle
states, thus leading to increased wakeup latency and lower performance with no
power saving. For 'menu' while web browsing on Android for instance, those
types of wakeups ('too deep') account for over 24% of all wakeups.

At the same time, on some platforms idle state 0 can be power efficient
enough to warrant wanting to prefer it over idle state 1. This is because
the power usage of the two states can be so close that sufficient amounts
of too deep state 1 sleeps can completely offset the state 1 power saving to the
point where it would've been more power efficient to just use state 0 instead.
This is of course for systems where state 0 is not a polling state, such as
arm-based devices.

Sleeps that happened in state 0 while they could have used state 1 ('too shallow') only
save less power than they otherwise could have. Too deep sleeps, on the other
hand, harm performance and nullify the potential power saving from using state 1 in
the first place. While taking this into account, it is clear that on balance it
is preferable for an idle governor to have more too shallow sleeps instead of
more too deep sleeps on those kinds of platforms.

This patch specifically tunes TEO to prefer shallower idle states in
order to reduce wakeup latency and achieve better performance.
To this end, before selecting the next idle state it uses the avg_util signal
of a CPU's runqueue in order to determine to what extent the CPU is being utilized.
This util value is then compared to a threshold defined as a percentage of the
cpu's capacity (capacity >> 6 ie. ~1.5% in the current implementation). If the
util is above the threshold, the idle state selected by TEO metrics will be
reduced by 1, thus selecting a shallower state. If the util is below the threshold,
the governor defaults to the TEO metrics mechanism to try to select the deepest
available idle state based on the closest timer event and its own correctness.

The main goal of this is to reduce latency and increase performance for some
workloads. Under some workloads it will result in an increase in power usage
(Geekbench 5) while for other workloads it will also result in a decrease in
power usage compared to TEO (PCMark Web, Jankbench, Speedometer).

It can provide drastically decreased latency and performance benefits in certain
types of workloads that are sensitive to latency.

Example test results:

1. GB5 (better score, latency & more power usage)

| metric                                | menu           | teo               | teo-util-aware    |
| ------------------------------------- | -------------- | ----------------- | ----------------- |
| gmean score                           | 2826.5 (0.0%)  | 2764.8 (-2.18%)   | 2865 (1.36%)      |
| gmean power usage [mW]                | 2551.4 (0.0%)  | 2606.8 (2.17%)    | 2722.3 (6.7%)     |
| gmean too deep %                      | 14.99%         | 9.65%             | 4.02%             |
| gmean too shallow %                   | 2.5%           | 5.96%             | 14.59%            |
| gmean task wakeup latency (asynctask) | 78.16μs (0.0%) | 61.60μs (-21.19%) | 54.45μs (-30.34%) |

2. Jankbench (better score, latency & less power usage)

| metric                                | menu           | teo               | teo-util-aware    |
| ------------------------------------- | -------------- | ----------------- | ----------------- |
| gmean frame duration                  | 13.9 (0.0%)    | 14.7 (6.0%)       | 12.6 (-9.0%)      |
| gmean jank percentage                 | 1.5 (0.0%)     | 2.1 (36.99%)      | 1.3 (-17.37%)     |
| gmean power usage [mW]                | 144.6 (0.0%)   | 136.9 (-5.27%)    | 121.3 (-16.08%)   |
| gmean too deep %                      | 26.00%         | 11.00%            | 2.54%             |
| gmean too shallow %                   | 4.74%          | 11.89%            | 21.93%            |
| gmean wakeup latency (RenderThread)   | 139.5μs (0.0%) | 116.5μs (-16.49%) | 91.11μs (-34.7%)  |
| gmean wakeup latency (surfaceflinger) | 124.0μs (0.0%) | 151.9μs (22.47%)  | 87.65μs (-29.33%) |

Signed-off-by: Kajetan Puchalski <kajetan.puchalski@arm.com>
---
 drivers/cpuidle/governors/teo.c | 92 ++++++++++++++++++++++++++++++++-
 1 file changed, 91 insertions(+), 1 deletion(-)

Message ID	20230105145159.1089531-3-kajetan.puchalski@arm.com
State	New
Headers	Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4e01:0:0:0:0:0 with SMTP id p1csp338491wrt; Thu, 5 Jan 2023 06:56:18 -0800 (PST) X-Google-Smtp-Source: AMrXdXtQRlm2lYLulYcK4yGmS5SCKs8CVpRu4AE8HYUdFMomLEmjN07Mdpxlse5C4dltb07LRLka X-Received: by 2002:a17:906:d217:b0:7af:1139:de77 with SMTP id w23-20020a170906d21700b007af1139de77mr44030519ejz.4.1672930578775; Thu, 05 Jan 2023 06:56:18 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1672930578; cv=none; d=google.com; s=arc-20160816; b=CidK9En+GSTnWrJt6y3RAty1hNa3NnRV/VjKGUM0GhFUtuwloneWPWFi0KuFTYjqTa O7boo+qszPSLh8mRzKfIYnxUkfV0F05EyL8JmAyxJL7LqD/usEPHxVwmJLZu0npuBGYp BOAGOsq+LITufOBZsZh60cMK1aNcMYC44t6j+uYyeR2QMZJtxKJYZPAbdVaVFbrNi35s BSS8zrbPTLfuyLaNXaFmV7p99h/010K6JqdXP7ke9DTEvTD7UKMJ6CP1i6oe3ZnwhiD3 zS/KGxAneVC+sZLSJMYbbamrMvvz5zGIfLbDcrKPEevXbZds//5Dg2KVCYKvagZ8RaR0 lGjA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=Vo4UUwHhWQgZprPQvb/0GsS3oy/u1aJfmsFx5DCjWqE=; b=rX6k+nXM4OthkLnHAOSCi2b5yPpJpZVJ3y/3YGTeEp/goYF5shxciDMbOFmuuknW+Y c9AMtEpetI2xzCuU9FrTPQlsk7kmNobx9LX4ya/De/a32E+eAUJTBUhMo0/+Qr6bSXz0 N7WgDjCjdDaa0S5iG3FGnPJQCHd2ZsS8oRNPRxPMHpEWkW324EtkyzgMYbfxAtxLhZVB 5390fM+2Yt+VDD5aE2zXmHstDlMxRfKHK/9REP1YR1WdIpERmZ1RE6zeVM9vmYtTncab l49aVGiAUUysEmnMqUaoX02TdpZ0qeYwJXQN7VWrCzoC5smXGk/P1c/fn/giSw2wiSCt FmkQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id j13-20020a50ed0d000000b0048bd3a68aa1si13576064eds.171.2023.01.05.06.55.53; Thu, 05 Jan 2023 06:56:18 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233312AbjAEOwy (ORCPT <rfc822;tmhikaru@gmail.com> + 99 others); Thu, 5 Jan 2023 09:52:54 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51252 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233416AbjAEOwW (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Thu, 5 Jan 2023 09:52:22 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 46BB610F1; Thu, 5 Jan 2023 06:52:21 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id D047216F3; Thu, 5 Jan 2023 06:53:02 -0800 (PST) Received: from e126311.arm.com (unknown [10.57.76.65]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 828593F71A; Thu, 5 Jan 2023 06:52:19 -0800 (PST) From: Kajetan Puchalski <kajetan.puchalski@arm.com> To: rafael@kernel.org Cc: daniel.lezcano@linaro.org, lukasz.luba@arm.com, Dietmar.Eggemann@arm.com, dsmythies@telus.net, yu.chen.surf@gmail.com, kajetan.puchalski@arm.com, linux-pm@vger.kernel.org, linux-kernel@vger.kernel.org Subject: [PATCH v6 2/2] cpuidle: teo: Introduce util-awareness Date: Thu, 5 Jan 2023 14:51:59 +0000 Message-Id: <20230105145159.1089531-3-kajetan.puchalski@arm.com> X-Mailer: git-send-email 2.37.1 In-Reply-To: <20230105145159.1089531-1-kajetan.puchalski@arm.com> References: <20230105145159.1089531-1-kajetan.puchalski@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1754194732103815236?= X-GMAIL-MSGID: =?utf-8?q?1754194854736378045?=
Series	cpuidle: teo: Introduce util-awareness \| [v6,0/2] cpuidle: teo: Introduce util-awareness [v6,1/2] cpuidle: teo: Optionally skip polling states in teo_find_shallower_state() [v6,2/2] cpuidle: teo: Introduce util-awareness

[v6,2/2] cpuidle: teo: Introduce util-awareness

Commit Message

Comments

Patch