From patchwork Wed Dec 21 10:19:12 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Giuseppe Scrivano X-Patchwork-Id: 35326 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:adf:e747:0:0:0:0:0 with SMTP id c7csp3445385wrn; Wed, 21 Dec 2022 02:25:33 -0800 (PST) X-Google-Smtp-Source: AMrXdXtj8Nb3kLFconK30xc5wLhDhdzk1rDX8PZ9C7uEmoonTe5WWvYnHuRj2ijfDQD78vv50+mg X-Received: by 2002:a17:907:7e9f:b0:7c1:7d81:d2a8 with SMTP id qb31-20020a1709077e9f00b007c17d81d2a8mr1091359ejc.3.1671618332873; Wed, 21 Dec 2022 02:25:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1671618332; cv=none; d=google.com; s=arc-20160816; b=VFwEp/zE+u4sYmfbzlQU7Iv/WsHdjWsvJZ+ztAyAbRYBBPfAWVvNGkOAXG+D67A5C+ hsryleJsr2gK85g8B4pnv+0nLxujuMN0wh+7vEiVR5BmEZRXE6Uwc2AvgJLM+YKRGJ/V dYaXz8dtGZ6XGpmt81d92MqxD6QqYUM7TjIBDPVLeOJvedBt6s3dRlLbstjvxlNlXoWW dm9Rs6GNXp5vCjQWmyjircZ9NE1brCi2nKZYDSsbDM8Dj8Q9XOMu0ejav/lw6QFExzhe wO8euigs2kHub8/EpyGwCr8FGy3N/AHMOID8hAb2M8jsMw5Dg0hC6Wzc63JRwNYN3eLq EfVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=rZMl35g9o3N5e4d8Ph65UmYi5ZElld+YK7ZJYgVoFlI=; b=aDjyWkCbxzDTE1Mr0J5ilsVS4VRKOKE48ayqX1oHyrBliotoGtpNN+ajMT8uGh9BR0 7Orv5h4NmnSkHw0ZnxKicEuonsHLFDBjUbJw1o7hQ6VBXUZu1TLZe+SxHCsVq/wNV5PN EYTR4bowTCa78JokE5Ad1dsGFcLomu5D5VVBABsN7fgGN82M9o2mkdB+oc0DSD8v4w+H S3y0jz1r1tj0DzJTWywAlMLgZ2WBmodQAMcPo75jvM3pL7SCscjn3QSkJl2YmOQZ7H8d YpjHgox+WgdUgmiGVgm1ER2luYqEqVW9TgTpXTyneNHpHDo/ajLj7rM+rrwndxmckDuy kerQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=O50+w2cM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id qf22-20020a1709077f1600b007c10276464fsi1737505ejc.24.2022.12.21.02.25.10; Wed, 21 Dec 2022 02:25:32 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=O50+w2cM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234712AbiLUKVP (ORCPT + 99 others); Wed, 21 Dec 2022 05:21:15 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43870 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229472AbiLUKVF (ORCPT ); Wed, 21 Dec 2022 05:21:05 -0500 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 07073120B2 for ; Wed, 21 Dec 2022 02:20:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1671618022; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=rZMl35g9o3N5e4d8Ph65UmYi5ZElld+YK7ZJYgVoFlI=; b=O50+w2cMlYFnZd4JZt1ueAFl/X6fnyRGxOuPknNeu7hcsq+/hcfKPM5Lqz9XHE/jFttc7s pnDwVanIGBz8Iz/19Tvmp5oBuYNR3yyu7BHgPRnhMqjp9ewx1I3LmVRFv77sOXJaGULItu iC7SUgGFNnMJ22qyhmk0vaLI816NRwU= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-25-hSYzwpRUPaGU4jKGJ7rZpw-1; Wed, 21 Dec 2022 05:20:17 -0500 X-MC-Unique: hSYzwpRUPaGU4jKGJ7rZpw-1 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.rdu2.redhat.com [10.11.54.7]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 7C9F0802D19; Wed, 21 Dec 2022 10:20:16 +0000 (UTC) Received: from lithium.redhat.com (unknown [10.39.193.218]) by smtp.corp.redhat.com (Postfix) with ESMTP id CBE311400E44; Wed, 21 Dec 2022 10:20:14 +0000 (UTC) From: Giuseppe Scrivano To: linux-kernel@vger.kernel.org Cc: ebiederm@xmission.com, brauner@kernel.org, cyphar@cyphar.com, viro@zeniv.linux.org.uk, alexl@redhat.com, peterz@infradead.org, gscrivan@redhat.com Subject: [PATCH RFC 1/2] exec: add PR_HIDE_SELF_EXE prctl Date: Wed, 21 Dec 2022 11:19:12 +0100 Message-Id: <20221221101913.484203-1-gscrivan@redhat.com> MIME-Version: 1.0 X-Scanned-By: MIMEDefang 3.1 on 10.11.54.7 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1752818865262750356?= X-GMAIL-MSGID: =?utf-8?q?1752818865262750356?= This patch adds a new prctl called PR_HIDE_SELF_EXE which allows processes to hide their own /proc/*/exe file. When this prctl is used, every access to /proc/*/exe for the calling process will fail with ENOENT. This is useful for preventing issues like CVE-2019-5736, where an attacker can gain host root access by overwriting the binary in OCI runtimes through file-descriptor mishandling in containers. The current fix for CVE-2019-5736 is to create a read-only copy or a bind-mount of the current executable, and then re-exec the current process. With the new prctl, the read-only copy or bind-mount copy is not needed anymore. Signed-off-by: Giuseppe Scrivano --- fs/exec.c | 1 + fs/proc/base.c | 8 +++++--- include/linux/sched.h | 5 +++++ include/uapi/linux/prctl.h | 3 +++ kernel/sys.c | 9 +++++++++ tools/include/uapi/linux/prctl.h | 3 +++ 6 files changed, 26 insertions(+), 3 deletions(-) diff --git a/fs/exec.c b/fs/exec.c index ab913243a367..5a5dd964c3a3 100644 --- a/fs/exec.c +++ b/fs/exec.c @@ -1855,6 +1855,7 @@ static int bprm_execve(struct linux_binprm *bprm, /* execve succeeded */ current->fs->in_exec = 0; current->in_execve = 0; + task_clear_hide_self_exe(current); rseq_execve(current); acct_update_integrals(current); task_numa_free(current, false); diff --git a/fs/proc/base.c b/fs/proc/base.c index 9e479d7d202b..959968e2da0d 100644 --- a/fs/proc/base.c +++ b/fs/proc/base.c @@ -1723,19 +1723,21 @@ static int proc_exe_link(struct dentry *dentry, struct path *exe_path) { struct task_struct *task; struct file *exe_file; + long hide_self_exe; task = get_proc_task(d_inode(dentry)); if (!task) return -ENOENT; exe_file = get_task_exe_file(task); + hide_self_exe = task_hide_self_exe(task); put_task_struct(task); - if (exe_file) { + if (exe_file && !hide_self_exe) { *exe_path = exe_file->f_path; path_get(&exe_file->f_path); fput(exe_file); return 0; - } else - return -ENOENT; + } + return -ENOENT; } static const char *proc_pid_get_link(struct dentry *dentry, diff --git a/include/linux/sched.h b/include/linux/sched.h index 853d08f7562b..8db32d5fc285 100644 --- a/include/linux/sched.h +++ b/include/linux/sched.h @@ -1790,6 +1790,7 @@ static __always_inline bool is_percpu_thread(void) #define PFA_SPEC_IB_DISABLE 5 /* Indirect branch speculation restricted */ #define PFA_SPEC_IB_FORCE_DISABLE 6 /* Indirect branch speculation permanently restricted */ #define PFA_SPEC_SSB_NOEXEC 7 /* Speculative Store Bypass clear on execve() */ +#define PFA_HIDE_SELF_EXE 8 /* Hide /proc/self/exe for the process */ #define TASK_PFA_TEST(name, func) \ static inline bool task_##func(struct task_struct *p) \ @@ -1832,6 +1833,10 @@ TASK_PFA_CLEAR(SPEC_IB_DISABLE, spec_ib_disable) TASK_PFA_TEST(SPEC_IB_FORCE_DISABLE, spec_ib_force_disable) TASK_PFA_SET(SPEC_IB_FORCE_DISABLE, spec_ib_force_disable) +TASK_PFA_TEST(HIDE_SELF_EXE, hide_self_exe) +TASK_PFA_SET(HIDE_SELF_EXE, hide_self_exe) +TASK_PFA_CLEAR(HIDE_SELF_EXE, hide_self_exe) + static inline void current_restore_flags(unsigned long orig_flags, unsigned long flags) { diff --git a/include/uapi/linux/prctl.h b/include/uapi/linux/prctl.h index a5e06dcbba13..f12f3df12468 100644 --- a/include/uapi/linux/prctl.h +++ b/include/uapi/linux/prctl.h @@ -284,4 +284,7 @@ struct prctl_mm_map { #define PR_SET_VMA 0x53564d41 # define PR_SET_VMA_ANON_NAME 0 +#define PR_SET_HIDE_SELF_EXE 65 +#define PR_GET_HIDE_SELF_EXE 66 + #endif /* _LINUX_PRCTL_H */ diff --git a/kernel/sys.c b/kernel/sys.c index 5fd54bf0e886..e992f1b72973 100644 --- a/kernel/sys.c +++ b/kernel/sys.c @@ -2626,6 +2626,15 @@ SYSCALL_DEFINE5(prctl, int, option, unsigned long, arg2, unsigned long, arg3, case PR_SET_VMA: error = prctl_set_vma(arg2, arg3, arg4, arg5); break; + case PR_SET_HIDE_SELF_EXE: + if (arg2 != 1 || arg3 || arg4 || arg5) + return -EINVAL; + task_set_hide_self_exe(current); + break; + case PR_GET_HIDE_SELF_EXE: + if (arg2 || arg3 || arg4 || arg5) + return -EINVAL; + return task_hide_self_exe(current) ? 1 : 0; default: error = -EINVAL; break; diff --git a/tools/include/uapi/linux/prctl.h b/tools/include/uapi/linux/prctl.h index a5e06dcbba13..f12f3df12468 100644 --- a/tools/include/uapi/linux/prctl.h +++ b/tools/include/uapi/linux/prctl.h @@ -284,4 +284,7 @@ struct prctl_mm_map { #define PR_SET_VMA 0x53564d41 # define PR_SET_VMA_ANON_NAME 0 +#define PR_SET_HIDE_SELF_EXE 65 +#define PR_GET_HIDE_SELF_EXE 66 + #endif /* _LINUX_PRCTL_H */