[GIT,pull] timers/core for 6.4-rc1

  Linus,

please pull the latest timers/core branch from:

   git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git timers-core-2023-04-24

up to:  f7abf14f0001: posix-cpu-timers: Implement the missing timer_wait_running callback

Timers and timekeeping updates:

  - Improve the VDSO build time checks to cover all dynamic relocations

    VDSO does not allow dynamic relcations, but the build time check is
    incomplete and fragile.

    It's based on architectures specifying the relocation types to search
    for and does not handle R_*_NONE relocation entries correctly.
    R_*_NONE relocations are injected by some GNU ld variants if they fail
    to determine the exact .rel[a]/dyn_size to cover trailing zeros.
    R_*_NONE relocations must be ignored by dynamic loaders, so they
    should be ignored in the build time check too.

    Remove the architecture specific relocation types to check for and
    validate strictly that no other relocations than R_*_NONE end up
    in the VSDO .so file.

  - Prefer signal delivery to the current thread for
    CLOCK_PROCESS_CPUTIME_ID based posix-timers

    Such timers prefer to deliver the signal to the main thread of a
    process even if the context in which the timer expires is the current
    task. This has the downside that it might wake up an idle thread.

    As there is no requirement or guarantee that the signal has to be
    delivered to the main thread, avoid this by preferring the current
    task if it is part of the thread group which shares sighand.

    This not only avoids waking idle threads, it also distributes the
    signal delivery in case of multiple timers firing in the context
    of different threads close to each other better.

  - Align the tick period properly (again)

    For a long time the tick was starting at CLOCK_MONOTONIC zero, which
    allowed users space applications to either align with the tick or to
    place a periodic computation so that it does not interfere with the
    tick. The alignement of the tick period was more by chance than by
    intention as the tick is set up before a high resolution clocksource is
    installed, i.e. timekeeping is still tick based and the tick period
    advances from there.

    The early enablement of sched_clock() broke this alignement as the time
    accumulated by sched_clock() is taken into account when timekeeping is
    initialized. So the base value now(CLOCK_MONOTONIC) is not longer a
    multiple of tick periods, which breaks applications which relied on
    that behaviour.

    Cure this by aligning the tick starting point to the next multiple of
    tick periods, i.e 1000ms/CONFIG_HZ.

 - A set of NOHZ fixes and enhancements

   - Cure the concurrent writer race for idle and IO sleeptime statistics

     The statitic values which are exposed via /proc/stat are updated from
     the CPU local idle exit and remotely by cpufreq, but that happens
     without any form of serialization. As a consequence sleeptimes can be
     accounted twice or worse.

     Prevent this by restricting the accumulation writeback to the CPU
     local idle exit and let the remote access compute the accumulated
     value.

   - Protect idle/iowait sleep time with a sequence count

     Reading idle/iowait sleep time, e.g. from /proc/stat, can race with
     idle exit updates. As a consequence the readout may result in random
     and potentially going backwards values.

     Protect this by a sequence count, which fixes the idle time
     statistics issue, but cannot fix the iowait time problem because
     iowait time accounting races with remote wake ups decrementing the
     remote runqueues nr_iowait counter. The latter is impossible to fix,
     so the only way to deal with that is to document it properly and to
     remove the assertion in the selftest which triggers occasionally due
     to that.

   - Restructure struct tick_sched for better cache layout

   - Some small cleanups and a better cache layout for struct tick_sched

 - Implement the missing timer_wait_running() callback for POSIX CPU timers

   For unknown reason the introduction of the timer_wait_running() callback
   missed to fixup posix CPU timers, which went unnoticed for almost four
   years.

   While initially only targeted to prevent livelocks between a timer
   deletion and the timer expiry function on PREEMPT_RT enabled kernels, it
   turned out that fixing this for mainline is not as trivial as just
   implementing a stub similar to the hrtimer/timer callbacks.

   The reason is that for CONFIG_POSIX_CPU_TIMERS_TASK_WORK enabled systems
   there is a livelock issue independent of RT.

   CONFIG_POSIX_CPU_TIMERS_TASK_WORK=y moves the expiry of POSIX CPU timers
   out from hard interrupt context to task work, which is handled before
   returning to user space or to a VM. The expiry mechanism moves the
   expired timers to a stack local list head with sighand lock held. Once
   sighand is dropped the task can be preempted and a task which wants to
   delete a timer will spin-wait until the expiry task is scheduled back
   in. In the worst case this will end up in a livelock when the preempting
   task and the expiry task are pinned on the same CPU.

   The timer wheel has a timer_wait_running() mechanism for RT, which uses
   a per CPU timer-base expiry lock which is held by the expiry code and the
   task waiting for the timer function to complete blocks on that lock.

   This does not work in the same way for posix CPU timers as there is no
   timer base and expiry for process wide timers can run on any task
   belonging to that process, but the concept of waiting on an expiry lock
   can be used too in a slightly different way.

   Add a per task mutex to struct posix_cputimers_work, let the expiry task
   hold it accross the expiry function and let the deleting task which
   waits for the expiry to complete block on the mutex.

   In the non-contended case this results in an extra mutex_lock()/unlock()
   pair on both sides.

   This avoids spin-waiting on a task which is scheduled out, prevents the
   livelock and cures the problem for RT and !RT systems.

Thanks,

	tglx

------------------>
Dmitry Vyukov (2):
      posix-timers: Prefer delivery of signals to the current thread
      selftests/timers/posix_timers: Test delivery of signals across threads

Fangrui Song (1):
      vdso: Improve cmd_vdso_check to check all dynamic relocations

Frederic Weisbecker (8):
      timers/nohz: Restructure and reshuffle struct tick_sched
      timers/nohz: Only ever update sleeptime from idle exit
      timers/nohz: Protect idle/iowait sleep time under seqcount
      timers/nohz: Add a comment about broken iowait counter update race
      timers/nohz: Remove middle-function __tick_nohz_idle_stop_tick()
      MAINTAINERS: Remove stale email address
      selftests/proc: Remove idle time monotonicity assertions
      selftests/proc: Assert clock_gettime(CLOCK_BOOTTIME) VS /proc/uptime monotonicity

Sebastian Andrzej Siewior (1):
      tick/common: Align tick period with the HZ tick.

Thomas Gleixner (1):
      posix-cpu-timers: Implement the missing timer_wait_running callback

 MAINTAINERS                                    |   2 +-
 arch/arm/vdso/Makefile                         |   4 +-
 arch/arm64/kernel/vdso/Makefile                |   4 +-
 arch/arm64/kernel/vdso32/Makefile              |   3 -
 arch/csky/kernel/vdso/Makefile                 |   4 +-
 arch/loongarch/vdso/Makefile                   |   4 +-
 arch/mips/vdso/Makefile                        |   4 +-
 arch/powerpc/kernel/vdso/Makefile              |   2 +-
 arch/riscv/kernel/vdso/Makefile                |   4 +-
 arch/s390/kernel/vdso32/Makefile               |   3 +-
 arch/s390/kernel/vdso64/Makefile               |   3 +-
 arch/x86/entry/vdso/Makefile                   |   5 +-
 include/linux/posix-timers.h                   |  17 ++--
 kernel/signal.c                                |  21 +++-
 kernel/time/posix-cpu-timers.c                 |  81 ++++++++++++---
 kernel/time/posix-timers.c                     |   4 +
 kernel/time/tick-common.c                      |  12 ++-
 kernel/time/tick-sched.c                       | 135 ++++++++++++-------------
 kernel/time/tick-sched.h                       |  67 +++++++-----
 lib/vdso/Makefile                              |  13 +--
 tools/testing/selftests/proc/proc-uptime-001.c |  25 +++--
 tools/testing/selftests/proc/proc-uptime-002.c |  27 +++--
 tools/testing/selftests/proc/proc-uptime.h     |  28 ++---
 tools/testing/selftests/timers/posix_timers.c  |  77 ++++++++++++++
 24 files changed, 361 insertions(+), 188 deletions(-)

Message ID	168235969244.840202.3708265453324842162.tglx@xen13
State	New
Headers	Return-Path: <linux-kernel-owner@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:b0ea:0:b0:3b6:4342:cba0 with SMTP id b10csp2927821vqo; Mon, 24 Apr 2023 11:18:24 -0700 (PDT) X-Google-Smtp-Source: AKy350bKnC4cqff7Ic+I3KC/IwNRqKAKMa3YN0bD8U2pFmH2CjoSrBzQSjRBL5rGvPriniT6JvzC X-Received: by 2002:a17:90b:e90:b0:24b:8b80:ae7a with SMTP id fv16-20020a17090b0e9000b0024b8b80ae7amr8213478pjb.16.1682360304040; Mon, 24 Apr 2023 11:18:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1682360304; cv=none; d=google.com; s=arc-20160816; b=gPsxFLo7P0sjdC3kk/oVZruqCqd8dLsU9xs/M02nzPZIP4bKXbl457JLtoWhiNyoWf cvfVdEYh7ucRaiA4i7cGdlfrgGsduME3+mtCyu1gR9IWql1q6sYped4PZ6FFbkUZhBCL YuJRa34BrvqAUwnnIfMjnwDgVbbcpAGc+Eb0Dz8e6ngncEo2d9MM1+12gA0fxCgOJD2g FuUhE7PtumD1FjGNM/1zf57psV9NGPyIXr+QWebYQQ7oeNsUPWpUXo3MuvjIP33Sfiys nSD63qu5Xxb/gYgPIZKWpy450Wk4rjQSYCHLfah6grd4QVfhyuZ8ufIVk2qTxPimwf7P HfwA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:date:mime-version:content-transfer-encoding :message-id:references:subject:cc:to:dkim-signature:dkim-signature :from; bh=0ooCSzoFBdO2JRjHsdDpenYgzuyvCYluYx0ojm1MFM8=; b=WPXJ9zirLo3uosjkbz6XHrKC/jSlP+AixDMX8dhRV1N0t5RJs6uXbOwLLRJ9/gX0VM QbeeUtP/71WKMwMX2RSIP6aycfSyD/fkoe8Dxq2/owYC4CVRBAEQE1RyR8GbknxYMddR chdAcJZ1xVkYtmQsea6HwpetpmYR+NWlqKNb+3aqfbeb6XnCuH/URLHQhoYtg02of1gz 6EM+PGsbyXKewPPcsthtFvxauwlzasm5fd1ZVMWxFQmNWxx3d/3+wCpCE9+/O6MlzXGD 5/kWTKRmWno07iluUtDFuSLSxya4tzAZNTK0eehkHxpjt8MPMyMvDiznsnFRHE4I+VdY mfMQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=2pjpldFp; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l188-20020a6325c5000000b0050bf0f1b79csi12917775pgl.699.2023.04.24.11.18.11; Mon, 24 Apr 2023 11:18:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=2pjpldFp; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232176AbjDXSNr (ORCPT <rfc822;fengqi706@gmail.com> + 99 others); Mon, 24 Apr 2023 14:13:47 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51952 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232313AbjDXSNf (ORCPT <rfc822;linux-kernel@vger.kernel.org>); Mon, 24 Apr 2023 14:13:35 -0400 Received: from galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2927459E7 for <linux-kernel@vger.kernel.org>; Mon, 24 Apr 2023 11:13:30 -0700 (PDT) From: Thomas Gleixner <tglx@linutronix.de> DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1682360008; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: references:references; bh=0ooCSzoFBdO2JRjHsdDpenYgzuyvCYluYx0ojm1MFM8=; b=2pjpldFpJXFMrO2r4VWkl6rV0rficHF6bcwY5io8i+h6BxoNcBZjFa5geFflRhvHWzBiXX 9BSFYVNeDhgKyFk/1iWkCws2yVxbB7pLB1u9wwnqkPCkmXWn1ymtMzb37oAoblCrh3plDA Ao5cNI/p5LlRL1jo/VAQRT+S10jMb0hYdjE1E8SKGPGBaApukbzksAr0XL407udf84120b NaoJJlwS9+7JwOYjHmW3FUTenl+/ARR7bIqBL0SiS3FDOih8gyEo98c4js/L/NaBHfS3qx Zptq9+aaOKQf0DgagbsH1NcxPk6OD1WNPrCRgAaiOZrba5RcAK2eocf7mMDFhg== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1682360008; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: references:references; bh=0ooCSzoFBdO2JRjHsdDpenYgzuyvCYluYx0ojm1MFM8=; b=gmEFNWDP7CD1Wj715ssjDqDTx2yb1scgtOM83+SsUaaIRBzXJCxVAifNaav1GxJtoWlJXm 8ijC1G/Y0m1Ag+Dg== To: Linus Torvalds <torvalds@linux-foundation.org> Cc: linux-kernel@vger.kernel.org, x86@kernel.org Subject: [GIT pull] timers/core for 6.4-rc1 References: <168235968801.840202.17752066425816055574.tglx@xen13> Message-ID: <168235969244.840202.3708265453324842162.tglx@xen13> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 Date: Mon, 24 Apr 2023 20:13:28 +0200 (CEST) X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: <linux-kernel.vger.kernel.org> X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1764082638061922745?= X-GMAIL-MSGID: =?utf-8?q?1764082638061922745?=
Series	[GIT,pull] timers/core for 6.4-rc1 \| [GIT,pull] timers/core for 6.4-rc1

[GIT,pull] timers/core for 6.4-rc1

Commit Message

Comments

Patch