From patchwork Thu Oct 27 02:53:00 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Gang Li X-Patchwork-Id: 751 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:6687:0:0:0:0:0 with SMTP id l7csp605681wru; Wed, 26 Oct 2022 19:57:24 -0700 (PDT) X-Google-Smtp-Source: AMsMyM6B+ioi68xJmj7JXwW41Swj7uFiQo/nrhM4O8nC9f5fIz5vZ1e48SoDnNcbkAeV0fZL7LNj X-Received: by 2002:a17:907:3e85:b0:73d:60cc:5d06 with SMTP id hs5-20020a1709073e8500b0073d60cc5d06mr41390038ejc.722.1666839443959; Wed, 26 Oct 2022 19:57:23 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1666839443; cv=none; d=google.com; s=arc-20160816; b=KhuOjIErT2G77CbS8AY6F9kIF5CySsZVfbhDL8AGwN/6yjxwREpLkokrXYwwstKD2f HK6w9pxzZzNUj7hdVh9XCegwwtnlUj+eRMuMSROiXoDgqiu+Qy8mztmKF4aZxA4fEe79 c8A2AdRb5S3gzOKhzS3watdpN+3WIOvf+RAgl2l0yhsFTycaI0oOu6+fz1T3nxpd7fqY 7EgJN3DgQJR+/smMK8OoXVK7AV4lLsPHJJ69XFAY2hXysgiRzfWp7mom9x5XBsgMjP78 kuijrN6y1O5bHaMfmd7eIi92z6DgztW6jEtdVyVVJqK0zxl4U5ptsdLskRqbVsq8PA6G Q3kQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:to:content-transfer-encoding:mime-version :message-id:date:subject:cc:from:dkim-signature; bh=vQp+rTjOO9UdjsGEvv6ixHqu9x9F8NVITMJFC8Rdz0Y=; b=JPOouAqkpPs3XKRgU8wCQjy+feSOJscaizS8mVKM5SIaoT5sX1w4VShFTxJMqTWOb7 +hywdL3HBF5beEQbrogzZCnMY+kNCnfHUa1P0ocawphHdzctnlvpLQ+N2eRKxk7len/P cOW5mDd3VNCJmPLa6HX92aritFiuBd9km7wyune+zp/o5o/6QTevYIkpMhOs0+yVz64o /zAXcUXgAgJpG0G3EvRVoGEpYRVOcS49U1hzc3Y0AAKJ/l0dOOrUgtwuDvKfn8G42oMH v4dLNNdT1vfqqjIv7RcbcUA+HWz943Vlnr6hfxvzffnJ+h4gC72baeh3AUtH5SkqQAYl fljA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=Bx8Zq1Gh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id qx26-20020a170906fcda00b0073c0169863dsi176847ejb.465.2022.10.26.19.56.59; Wed, 26 Oct 2022 19:57:23 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=fail header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=Bx8Zq1Gh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233963AbiJ0CyA (ORCPT + 99 others); Wed, 26 Oct 2022 22:54:00 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55318 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233941AbiJ0Cx4 (ORCPT ); Wed, 26 Oct 2022 22:53:56 -0400 Received: from mail-pl1-x633.google.com (mail-pl1-x633.google.com [IPv6:2607:f8b0:4864:20::633]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3170812743D for ; Wed, 26 Oct 2022 19:53:54 -0700 (PDT) Received: by mail-pl1-x633.google.com with SMTP id p3so38441pld.10 for ; Wed, 26 Oct 2022 19:53:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20210112.gappssmtp.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:from:to:cc:subject:date:message-id:reply-to; bh=vQp+rTjOO9UdjsGEvv6ixHqu9x9F8NVITMJFC8Rdz0Y=; b=Bx8Zq1GhyzQTw6V+JmbgzcD8OPYXvIJdoKfXdrU0+qUxDAcHN2MaXNbgRqLENV+7CL z61wzbjcJkyAnGsRD1OSVnsBuW74Phb1fMCx/4ZaV/bT8zkG11bu6cC9cBEZApf+sih4 hDJPX93JP1wqq+kmzWvvfB6jDM9YaoELP+9O8lbsfT+wpSskve4IsaTf0w1/0ZYLNR+8 mRFMEiZ3i7Iw/eFLGPp3su85U3CTFBbESVq64jQstkgHdIXNRlFksGBnrosfUdOBJV/D kAquuGwj/bce5QFzqR4uxfxHa/858FLI6/n53CLBBWz2Lx1JFsV9uiPHuc7aSnRtEbpe HBTw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:subject:cc :to:from:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=vQp+rTjOO9UdjsGEvv6ixHqu9x9F8NVITMJFC8Rdz0Y=; b=q/MPxqZ1srQoXvx7Dl8b1FVKBWum7eg3aTDQ+ou+FB23GxZtcV992bALkjw2OEi+hO KVVow9bvA6fwpU9I13JnCHA0Drpl4w3Lsb1z6fJapf75Vxp+uC0AQ812UzK1/F4ZQpqs +s8ZHig8741nejC9Z5H7/IQETLj2i/Dqc2A9HLC6d30aNDPTcclzQiMpW4UySB2ruFLE EE3yBue28Y7deMhfETFCHlevyWvIybeWOSVCRy5LLEimySSoPv+mSCAzE21ecocNoDxh lRHjjEinRcY023Z2utPsPqNbuJpw/gxVNk1VNZxOgYbkT/lHnSadD0F7vtsUd5LgESds /uZA== X-Gm-Message-State: ACrzQf1YaRBylpALl1odxSAw0wbeAGuj/fB5AfAXV4xYiD0yWCzdP1wQ E3obaT1TzA7ETV8Mlx0Vk7JaTfapWJL1dQ== X-Received: by 2002:a17:90b:254e:b0:20b:7e26:f0a0 with SMTP id nw14-20020a17090b254e00b0020b7e26f0a0mr7448390pjb.203.1666839234270; Wed, 26 Oct 2022 19:53:54 -0700 (PDT) Received: from C02FT5A6MD6R.bytedance.net ([61.120.150.75]) by smtp.gmail.com with ESMTPSA id o1-20020aa79781000000b0056b6a22d6c9sm79783pfp.212.2022.10.26.19.53.51 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 26 Oct 2022 19:53:53 -0700 (PDT) From: Gang Li Cc: Gang Li , linux-api@vger.kernel.org, linux-kernel@vger.kernel.org Subject: [PATCH v5 0/2] sched/numa: add per-process numa_balancing Date: Thu, 27 Oct 2022 10:53:00 +0800 Message-Id: <20221027025302.45766-1-ligang.bdlg@bytedance.com> X-Mailer: git-send-email 2.32.0 MIME-Version: 1.0 X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net To: unlisted-recipients:; (no To-header on input) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1747807836746776266?= X-GMAIL-MSGID: =?utf-8?q?1747807836746776266?= # Introduce Add PR_NUMA_BALANCING in prctl. A large number of page faults will cause performance loss when numa balancing is performing. Thus those processes which care about worst-case performance need numa balancing disabled. Others, on the contrary, allow a temporary performance loss in exchange for higher average performance, so enable numa balancing is better for them. Numa balancing can only be controlled globally by /proc/sys/kernel/numa_balancing. Due to the above case, we want to disable/enable numa_balancing per-process instead. Set per-process numa balancing: prctl(PR_NUMA_BALANCING, PR_SET_NUMA_BALANCING_DISABLE); //disable prctl(PR_NUMA_BALANCING, PR_SET_NUMA_BALANCING_ENABLE); //enable prctl(PR_NUMA_BALANCING, PR_SET_NUMA_BALANCING_DEFAULT); //follow global Get numa_balancing state: prctl(PR_NUMA_BALANCING, PR_GET_NUMA_BALANCING, &ret); cat /proc//status | grep NumaB_mode # Unixbench multithread result I ran benchmark 20 times, but still have measurement error. I will run benchmark more precisely on the next version of this patchset. +-------------------+----------+ | NAME | OVERHEAD | +-------------------+----------+ | Dhrystone2 | -0.27% | | Whetstone | -0.17% | | Execl | -0.92% | | File_Copy_1024 | 0.31% | | File_Copy_256 | -1.96% | | File_Copy_4096 | 0.40% | | Pipe_Throughput | -3.08% | | Context_Switching | -1.11% | | Process_Creation | 3.24% | | Shell_Scripts_1 | 0.26% | | Shell_Scripts_8 | 0.32% | | System_Call | 0.10% | +-------------------+----------+ | Total | -0.21% | +-------------------+----------+ # Changes Changes in v5: - replace numab_enabled with numa_balancing_mode (Peter Zijlstra) - make numa_balancing_enabled and numa_balancing_mode inline (Peter Zijlstra) - use static_branch_inc/dec instead of static_branch_enable/disable (Peter Zijlstra) - delete CONFIG_NUMA_BALANCING in task_tick_fair (Peter Zijlstra) - reword commit, use imperative mood (Bagas Sanjaya) - Unixbench overhead result Changes in v4: - code clean: add wrapper function `numa_balancing_enabled` Changes in v3: - Fix compile error. Changes in v2: - Now PR_NUMA_BALANCING support three states: enabled, disabled, default. enabled and disabled will ignore global setting, and default will follow global setting. Gang Li (2): sched/numa: use static_branch_inc/dec for sched_numa_balancing sched/numa: add per-process numa_balancing Documentation/filesystems/proc.rst | 2 ++ fs/proc/task_mmu.c | 20 ++++++++++++ include/linux/mm_types.h | 3 ++ include/linux/sched/numa_balancing.h | 45 ++++++++++++++++++++++++++ include/uapi/linux/prctl.h | 7 +++++ kernel/fork.c | 4 +++ kernel/sched/core.c | 26 +++++++-------- kernel/sched/fair.c | 9 +++--- kernel/sys.c | 47 ++++++++++++++++++++++++++++ mm/mprotect.c | 6 ++-- 10 files changed, 150 insertions(+), 19 deletions(-) Acked-by: John Hubbard