Message ID | 20240212213922.783301-11-surenb@google.com |
---|---|
State | New |
Headers |
Return-Path: <linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:7300:bc8a:b0:106:860b:bbdd with SMTP id dn10csp183210dyb; Mon, 12 Feb 2024 13:44:55 -0800 (PST) X-Forwarded-Encrypted: i=3; AJvYcCXHbu7168pRe6+f91OyKhv8OeIj2RMKLyXjoBgLrBgDDjfILF9YlwFwW9zDz5B4EblnSKYdSBYpuzCZjGbpqiGgN+wy1A== X-Google-Smtp-Source: AGHT+IHOGn5inqoykFL/rCSCoilracJ0tXWdpUouTAwrx9gJMq59oToSTbDPGYmCMGPNqJkau9RB X-Received: by 2002:a17:906:4a8f:b0:a3c:dcef:8e96 with SMTP id x15-20020a1709064a8f00b00a3cdcef8e96mr1243179eju.27.1707774294987; Mon, 12 Feb 2024 13:44:54 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1707774294; cv=pass; d=google.com; s=arc-20160816; b=CERdpRLGDod+kO6gH4I0gfZAGUit2ov+ER7uB9tk+B+UgcbwFRXAJou0rPwrcz62TO lxv0ZP7SY6l1WO5FmyVeDn8R/VsF9d0RpRI6SuPgOqoKw9iVnIHCraTR1KXI5y8Ieav+ c2Yi51EC1iHC4toKKkmNT5RuLN83rvfKYRmCrFWO+9MD676+Mz2+P0Ncf4zjQ7Wc86OJ RH3qpI/0wz7xwzwwbladk4kOvBNpCuLYAhFQrtvV0AyJafK6V9e1UfiIq9ErMT0nnN3b JdV18+iRGvlsxqpqwMsjg28rtBTEywWzaXzDCD9cjnaNlHDkgibvEdnVIDhe28Ucx1Fk MuTQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=cc:to:from:subject:message-id:references:mime-version :list-unsubscribe:list-subscribe:list-id:precedence:in-reply-to:date :dkim-signature; bh=3+XWH7AoCI6Q7n29IBIT816HNKsBqFH+UYcKnJYD23I=; fh=abM2omETkk2bjr43tWlxndfNjpSUyFYYgZQhCZufsKY=; b=zbERfK4hDdW1njHY5gu/IIq7c5ZRshpWyeLRMWAarXJKAC+XEdyPy5JGXfcM2Q1sUG H+SE/3s4k3JM9I26fIah68Sg8TwqN+ITUAe3EgSNtFfwQBk1KcdlR3NcMmGX0kga6FN6 bWwGsDa31D7KdJByPuWMjkOdzbEt8KzMWosEx4SCQMgnfEKFpiYyFoTzKkb//UNEdh+e 9xSzfDxYpBLbqtJpbYJ2FDlqBLEUfh7iSuykVQUTketYVH+ZKcFjt1GTIi1cD70n070m 0dp/M9QMK7sKs/VOrWxhX7zVlcm8GFX2h51gg+N+herzk1AF80Fn7J0V+V4GDp1YQv7I JcGg==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=i9BCYESm; arc=pass (i=1 spf=pass spfdomain=flex--surenb.bounces.google.com dkim=pass dkdomain=google.com dmarc=pass fromdomain=google.com); spf=pass (google.com: domain of linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org"; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com X-Forwarded-Encrypted: i=2; AJvYcCWvxujOvZ8oRutBjFt3Lr5XU0slKj3qQVTXb9+rMzRPeYQ1nRRQylJtWvlfQtDHGINGQ4hUn+56tLHH8J9quvE/ddrdYQ== Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id y20-20020a1709060a9400b00a3cf73f2a22si81302ejf.125.2024.02.12.13.44.54 for <ouuuleilei@gmail.com> (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 12 Feb 2024 13:44:54 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=i9BCYESm; arc=pass (i=1 spf=pass spfdomain=flex--surenb.bounces.google.com dkim=pass dkdomain=google.com dmarc=pass fromdomain=google.com); spf=pass (google.com: domain of linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-62416-ouuuleilei=gmail.com@vger.kernel.org"; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 6B4811F23B1C for <ouuuleilei@gmail.com>; Mon, 12 Feb 2024 21:44:54 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 5F34254679; Mon, 12 Feb 2024 21:40:03 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="i9BCYESm" Received: from mail-yb1-f202.google.com (mail-yb1-f202.google.com [209.85.219.202]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 5EE3353381 for <linux-kernel@vger.kernel.org>; Mon, 12 Feb 2024 21:39:55 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.219.202 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1707773998; cv=none; b=DDD5G7rcfd67fI8qMkLa/2+9BqHiJkVNSThyhskkdQpWHrsWViSJ3n6v1JZuDFrU+b6CGkhY21d8OtyDWDFFWgQ4G9e1cOwrt6k1mWIcs2akBepNdsXu6BZ96oF4EPifQX0z37bYNI/Rhj+/x4jSWOVIp0x7oFGwhM9wjFvNLVQ= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1707773998; c=relaxed/simple; bh=yw35aaReXg/k0dyxudJNlHkPhP6T+BxCJ8bIFrcG/Z8=; h=Date:In-Reply-To:Mime-Version:References:Message-ID:Subject:From: To:Cc:Content-Type; b=WrdsbOZj1kcyIkY2n0SSwDKL8JmTrcR5TITwc4Hgb9mJpTO1EWEUY8poWC9zrQGlobcWEvl2EEwYBOPGUOeuOi0AU6igX8mQI1WZMEALcTiX79exX/23Zm4/qs9PPXP71FMo9zOZ5kD+neffjaCWI03rBFf9oPRd5EFBSU3h6vw= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com; spf=pass smtp.mailfrom=flex--surenb.bounces.google.com; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b=i9BCYESm; arc=none smtp.client-ip=209.85.219.202 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=flex--surenb.bounces.google.com Received: by mail-yb1-f202.google.com with SMTP id 3f1490d57ef6-dc6ade10cb8so8232858276.0 for <linux-kernel@vger.kernel.org>; Mon, 12 Feb 2024 13:39:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1707773994; x=1708378794; darn=vger.kernel.org; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=3+XWH7AoCI6Q7n29IBIT816HNKsBqFH+UYcKnJYD23I=; b=i9BCYESmpnjSlCIWJ49ZCl2O4gjAywPJHD7X5oLS86RMEYyVitujiuJ7jwPMHaoqxz r3YRM2vbpJeTc53D0cglOuGSQl4E2QQzCjb/WSLRPCvo9apDvn4uMwP87zDQmgd65IUj AwnAK/y6515WxexhuQgCXWZUfJW5EBBKI2I2roHuWMkNcHE5hGodmszSw+0+n545/wnT jvaRZ2mXZo5e92PfQpitIrGY2QfhYDAjOBkY+OGjwO96shiqlAmlN5gY/jIooMl+/97M /sIzlFzuV2u0Rx8qFZ3/EaA3d4qUY5r3vlKetNAGZoul50gd8H0It2b+Znhf+MjxqSSQ sT/A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1707773994; x=1708378794; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=3+XWH7AoCI6Q7n29IBIT816HNKsBqFH+UYcKnJYD23I=; b=wvB6avij7/qa8A2NNmF0Mt09Wpf0hn5JvQji9JcwOuaqkr8WWsLDG1Pcmy0gq5S0ww LnLsXdA2n5vH//q1WqkYYlOLf5HdvJB/mVdEm2cly2p+Enlzb8zNKQfEciJPV7mmxMeu 5BZJZFL1fdG1ofjXMDBGRCae+6CYiRDPWrb3qNTRQxheKIOW5yzGbhPyYKSy+6eXA5id pvcVicIam2XSF8IYrzv+xvaz6joxxLUqC8pIqHO5Su7kwkgkMme3kGbUUJry64mxvYxF e2EwhpeEQyngCoecFv6iUSspNUDVwvhMOPWXJvipFKyS1Mt7SUMKmdrKFQwoNjrS6eFU h98Q== X-Forwarded-Encrypted: i=1; AJvYcCUCKiQqpEIZt8UW038wEhSxOWOUYvENfYxp1YjYdlbHFtcwte3+AgGMjfp3j3blCwc7woLdRCXOql7zjRqTWFqVY4e8nuTj6+3I38OZ X-Gm-Message-State: AOJu0YxCjWKVDqpMuwsftSscVLg/hknmr1E8NqG90saigdYE62S6j3G2 wQSqcJ8LcZRniNX5kk4Xy3By1Pp0PpKOz+QrYh3AL3cHEXGEGVOR8irGsKh6OK13yRXs559fS6I m8g== X-Received: from surenb-desktop.mtv.corp.google.com ([2620:15c:211:201:b848:2b3f:be49:9cbc]) (user=surenb job=sendgmr) by 2002:a25:2614:0:b0:dc6:b7c2:176e with SMTP id m20-20020a252614000000b00dc6b7c2176emr130871ybm.4.1707773994229; Mon, 12 Feb 2024 13:39:54 -0800 (PST) Date: Mon, 12 Feb 2024 13:38:56 -0800 In-Reply-To: <20240212213922.783301-1-surenb@google.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: <linux-kernel.vger.kernel.org> List-Subscribe: <mailto:linux-kernel+subscribe@vger.kernel.org> List-Unsubscribe: <mailto:linux-kernel+unsubscribe@vger.kernel.org> Mime-Version: 1.0 References: <20240212213922.783301-1-surenb@google.com> X-Mailer: git-send-email 2.43.0.687.g38aa6559b0-goog Message-ID: <20240212213922.783301-11-surenb@google.com> Subject: [PATCH v3 10/35] lib: code tagging framework From: Suren Baghdasaryan <surenb@google.com> To: akpm@linux-foundation.org Cc: kent.overstreet@linux.dev, mhocko@suse.com, vbabka@suse.cz, hannes@cmpxchg.org, roman.gushchin@linux.dev, mgorman@suse.de, dave@stgolabs.net, willy@infradead.org, liam.howlett@oracle.com, corbet@lwn.net, void@manifault.com, peterz@infradead.org, juri.lelli@redhat.com, catalin.marinas@arm.com, will@kernel.org, arnd@arndb.de, tglx@linutronix.de, mingo@redhat.com, dave.hansen@linux.intel.com, x86@kernel.org, peterx@redhat.com, david@redhat.com, axboe@kernel.dk, mcgrof@kernel.org, masahiroy@kernel.org, nathan@kernel.org, dennis@kernel.org, tj@kernel.org, muchun.song@linux.dev, rppt@kernel.org, paulmck@kernel.org, pasha.tatashin@soleen.com, yosryahmed@google.com, yuzhao@google.com, dhowells@redhat.com, hughd@google.com, andreyknvl@gmail.com, keescook@chromium.org, ndesaulniers@google.com, vvvvvv@google.com, gregkh@linuxfoundation.org, ebiggers@google.com, ytcoode@gmail.com, vincent.guittot@linaro.org, dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, bristot@redhat.com, vschneid@redhat.com, cl@linux.com, penberg@kernel.org, iamjoonsoo.kim@lge.com, 42.hyeyoo@gmail.com, glider@google.com, elver@google.com, dvyukov@google.com, shakeelb@google.com, songmuchun@bytedance.com, jbaron@akamai.com, rientjes@google.com, minchan@google.com, kaleshsingh@google.com, surenb@google.com, kernel-team@android.com, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, iommu@lists.linux.dev, linux-arch@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-mm@kvack.org, linux-modules@vger.kernel.org, kasan-dev@googlegroups.com, cgroups@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1790731139184513057 X-GMAIL-MSGID: 1790731139184513057 |
Series |
Memory allocation profiling
|
|
Commit Message
Suren Baghdasaryan
Feb. 12, 2024, 9:38 p.m. UTC
Add basic infrastructure to support code tagging which stores tag common information consisting of the module name, function, file name and line number. Provide functions to register a new code tag type and navigate between code tags. Co-developed-by: Kent Overstreet <kent.overstreet@linux.dev> Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev> Signed-off-by: Suren Baghdasaryan <surenb@google.com> --- include/linux/codetag.h | 71 ++++++++++++++ lib/Kconfig.debug | 4 + lib/Makefile | 1 + lib/codetag.c | 199 ++++++++++++++++++++++++++++++++++++++++ 4 files changed, 275 insertions(+) create mode 100644 include/linux/codetag.h create mode 100644 lib/codetag.c
Comments
On Mon, Feb 12, 2024 at 01:38:56PM -0800, Suren Baghdasaryan wrote: > Add basic infrastructure to support code tagging which stores tag common > information consisting of the module name, function, file name and line > number. Provide functions to register a new code tag type and navigate > between code tags. > > Co-developed-by: Kent Overstreet <kent.overstreet@linux.dev> > Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev> > Signed-off-by: Suren Baghdasaryan <surenb@google.com> > --- > include/linux/codetag.h | 71 ++++++++++++++ > lib/Kconfig.debug | 4 + > lib/Makefile | 1 + > lib/codetag.c | 199 ++++++++++++++++++++++++++++++++++++++++ > 4 files changed, 275 insertions(+) > create mode 100644 include/linux/codetag.h > create mode 100644 lib/codetag.c > > diff --git a/include/linux/codetag.h b/include/linux/codetag.h > new file mode 100644 > index 000000000000..a9d7adecc2a5 > --- /dev/null > +++ b/include/linux/codetag.h > @@ -0,0 +1,71 @@ > +/* SPDX-License-Identifier: GPL-2.0 */ > +/* > + * code tagging framework > + */ > +#ifndef _LINUX_CODETAG_H > +#define _LINUX_CODETAG_H > + > +#include <linux/types.h> > + > +struct codetag_iterator; > +struct codetag_type; > +struct seq_buf; > +struct module; > + > +/* > + * An instance of this structure is created in a special ELF section at every > + * code location being tagged. At runtime, the special section is treated as > + * an array of these. > + */ > +struct codetag { > + unsigned int flags; /* used in later patches */ > + unsigned int lineno; > + const char *modname; > + const char *function; > + const char *filename; > +} __aligned(8); > + > +union codetag_ref { > + struct codetag *ct; > +}; > + > +struct codetag_range { > + struct codetag *start; > + struct codetag *stop; > +}; > + > +struct codetag_module { > + struct module *mod; > + struct codetag_range range; > +}; > + > +struct codetag_type_desc { > + const char *section; > + size_t tag_size; > +}; > + > +struct codetag_iterator { > + struct codetag_type *cttype; > + struct codetag_module *cmod; > + unsigned long mod_id; > + struct codetag *ct; > +}; > + > +#define CODE_TAG_INIT { \ > + .modname = KBUILD_MODNAME, \ > + .function = __func__, \ > + .filename = __FILE__, \ > + .lineno = __LINE__, \ > + .flags = 0, \ > +} > + > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock); > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype); > +struct codetag *codetag_next_ct(struct codetag_iterator *iter); > + > +void codetag_to_text(struct seq_buf *out, struct codetag *ct); > + > +struct codetag_type * > +codetag_register_type(const struct codetag_type_desc *desc); > + > +#endif /* _LINUX_CODETAG_H */ > diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug > index 975a07f9f1cc..0be2d00c3696 100644 > --- a/lib/Kconfig.debug > +++ b/lib/Kconfig.debug > @@ -968,6 +968,10 @@ config DEBUG_STACKOVERFLOW > > If in doubt, say "N". > > +config CODE_TAGGING > + bool > + select KALLSYMS > + > source "lib/Kconfig.kasan" > source "lib/Kconfig.kfence" > source "lib/Kconfig.kmsan" > diff --git a/lib/Makefile b/lib/Makefile > index 6b09731d8e61..6b48b22fdfac 100644 > --- a/lib/Makefile > +++ b/lib/Makefile > @@ -235,6 +235,7 @@ obj-$(CONFIG_OF_RECONFIG_NOTIFIER_ERROR_INJECT) += \ > of-reconfig-notifier-error-inject.o > obj-$(CONFIG_FUNCTION_ERROR_INJECTION) += error-inject.o > > +obj-$(CONFIG_CODE_TAGGING) += codetag.o > lib-$(CONFIG_GENERIC_BUG) += bug.o > > obj-$(CONFIG_HAVE_ARCH_TRACEHOOK) += syscall.o > diff --git a/lib/codetag.c b/lib/codetag.c > new file mode 100644 > index 000000000000..7708f8388e55 > --- /dev/null > +++ b/lib/codetag.c > @@ -0,0 +1,199 @@ > +// SPDX-License-Identifier: GPL-2.0-only > +#include <linux/codetag.h> > +#include <linux/idr.h> > +#include <linux/kallsyms.h> > +#include <linux/module.h> > +#include <linux/seq_buf.h> > +#include <linux/slab.h> > + > +struct codetag_type { > + struct list_head link; > + unsigned int count; > + struct idr mod_idr; > + struct rw_semaphore mod_lock; /* protects mod_idr */ > + struct codetag_type_desc desc; > +}; > + > +static DEFINE_MUTEX(codetag_lock); > +static LIST_HEAD(codetag_types); > + > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock) > +{ > + if (lock) > + down_read(&cttype->mod_lock); > + else > + up_read(&cttype->mod_lock); > +} > + > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype) > +{ > + struct codetag_iterator iter = { > + .cttype = cttype, > + .cmod = NULL, > + .mod_id = 0, > + .ct = NULL, > + }; > + > + return iter; > +} > + > +static inline struct codetag *get_first_module_ct(struct codetag_module *cmod) > +{ > + return cmod->range.start < cmod->range.stop ? cmod->range.start : NULL; > +} > + > +static inline > +struct codetag *get_next_module_ct(struct codetag_iterator *iter) > +{ > + struct codetag *res = (struct codetag *) > + ((char *)iter->ct + iter->cttype->desc.tag_size); > + > + return res < iter->cmod->range.stop ? res : NULL; > +} > + > +struct codetag *codetag_next_ct(struct codetag_iterator *iter) > +{ > + struct codetag_type *cttype = iter->cttype; > + struct codetag_module *cmod; > + struct codetag *ct; > + > + lockdep_assert_held(&cttype->mod_lock); > + > + if (unlikely(idr_is_empty(&cttype->mod_idr))) > + return NULL; > + > + ct = NULL; > + while (true) { > + cmod = idr_find(&cttype->mod_idr, iter->mod_id); > + > + /* If module was removed move to the next one */ > + if (!cmod) > + cmod = idr_get_next_ul(&cttype->mod_idr, > + &iter->mod_id); > + > + /* Exit if no more modules */ > + if (!cmod) > + break; > + > + if (cmod != iter->cmod) { > + iter->cmod = cmod; > + ct = get_first_module_ct(cmod); > + } else > + ct = get_next_module_ct(iter); > + > + if (ct) > + break; > + > + iter->mod_id++; > + } > + > + iter->ct = ct; > + return ct; > +} > + > +void codetag_to_text(struct seq_buf *out, struct codetag *ct) > +{ > + seq_buf_printf(out, "%s:%u module:%s func:%s", > + ct->filename, ct->lineno, > + ct->modname, ct->function); > +} Thank you for using seq_buf here! Also, will this need an EXPORT_SYMBOL_GPL()? > + > +static inline size_t range_size(const struct codetag_type *cttype, > + const struct codetag_range *range) > +{ > + return ((char *)range->stop - (char *)range->start) / > + cttype->desc.tag_size; > +} > + > +static void *get_symbol(struct module *mod, const char *prefix, const char *name) > +{ > + char buf[64]; Why is 64 enough? I was expecting KSYM_NAME_LEN here, but perhaps this is specialized enough to section names that it will not be a problem? If so, please document it clearly with a comment. > + int res; > + > + res = snprintf(buf, sizeof(buf), "%s%s", prefix, name); > + if (WARN_ON(res < 1 || res > sizeof(buf))) > + return NULL; Please use a seq_buf here instead of snprintf, which we're trying to get rid of. DECLARE_SEQ_BUF(sb, KSYM_NAME_LEN); char *buf; seq_buf_printf(sb, "%s%s", prefix, name); if (seq_buf_has_overflowed(sb)) return NULL; buf = seq_buf_str(sb); > + > + return mod ? > + (void *)find_kallsyms_symbol_value(mod, buf) : > + (void *)kallsyms_lookup_name(buf); > +} > + > +static struct codetag_range get_section_range(struct module *mod, > + const char *section) > +{ > + return (struct codetag_range) { > + get_symbol(mod, "__start_", section), > + get_symbol(mod, "__stop_", section), > + }; > +} > + > +static int codetag_module_init(struct codetag_type *cttype, struct module *mod) > +{ > + struct codetag_range range; > + struct codetag_module *cmod; > + int err; > + > + range = get_section_range(mod, cttype->desc.section); > + if (!range.start || !range.stop) { > + pr_warn("Failed to load code tags of type %s from the module %s\n", > + cttype->desc.section, > + mod ? mod->name : "(built-in)"); > + return -EINVAL; > + } > + > + /* Ignore empty ranges */ > + if (range.start == range.stop) > + return 0; > + > + BUG_ON(range.start > range.stop); > + > + cmod = kmalloc(sizeof(*cmod), GFP_KERNEL); > + if (unlikely(!cmod)) > + return -ENOMEM; > + > + cmod->mod = mod; > + cmod->range = range; > + > + down_write(&cttype->mod_lock); > + err = idr_alloc(&cttype->mod_idr, cmod, 0, 0, GFP_KERNEL); > + if (err >= 0) > + cttype->count += range_size(cttype, &range); > + up_write(&cttype->mod_lock); > + > + if (err < 0) { > + kfree(cmod); > + return err; > + } > + > + return 0; > +} > + > +struct codetag_type * > +codetag_register_type(const struct codetag_type_desc *desc) > +{ > + struct codetag_type *cttype; > + int err; > + > + BUG_ON(desc->tag_size <= 0); > + > + cttype = kzalloc(sizeof(*cttype), GFP_KERNEL); > + if (unlikely(!cttype)) > + return ERR_PTR(-ENOMEM); > + > + cttype->desc = *desc; > + idr_init(&cttype->mod_idr); > + init_rwsem(&cttype->mod_lock); > + > + err = codetag_module_init(cttype, NULL); > + if (unlikely(err)) { > + kfree(cttype); > + return ERR_PTR(err); > + } > + > + mutex_lock(&codetag_lock); > + list_add_tail(&cttype->link, &codetag_types); > + mutex_unlock(&codetag_lock); > + > + return cttype; > +} > -- > 2.43.0.687.g38aa6559b0-goog >
On Mon, Feb 12, 2024 at 2:27 PM Kees Cook <keescook@chromium.org> wrote: > > On Mon, Feb 12, 2024 at 01:38:56PM -0800, Suren Baghdasaryan wrote: > > Add basic infrastructure to support code tagging which stores tag common > > information consisting of the module name, function, file name and line > > number. Provide functions to register a new code tag type and navigate > > between code tags. > > > > Co-developed-by: Kent Overstreet <kent.overstreet@linux.dev> > > Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev> > > Signed-off-by: Suren Baghdasaryan <surenb@google.com> > > --- > > include/linux/codetag.h | 71 ++++++++++++++ > > lib/Kconfig.debug | 4 + > > lib/Makefile | 1 + > > lib/codetag.c | 199 ++++++++++++++++++++++++++++++++++++++++ > > 4 files changed, 275 insertions(+) > > create mode 100644 include/linux/codetag.h > > create mode 100644 lib/codetag.c > > > > diff --git a/include/linux/codetag.h b/include/linux/codetag.h > > new file mode 100644 > > index 000000000000..a9d7adecc2a5 > > --- /dev/null > > +++ b/include/linux/codetag.h > > @@ -0,0 +1,71 @@ > > +/* SPDX-License-Identifier: GPL-2.0 */ > > +/* > > + * code tagging framework > > + */ > > +#ifndef _LINUX_CODETAG_H > > +#define _LINUX_CODETAG_H > > + > > +#include <linux/types.h> > > + > > +struct codetag_iterator; > > +struct codetag_type; > > +struct seq_buf; > > +struct module; > > + > > +/* > > + * An instance of this structure is created in a special ELF section at every > > + * code location being tagged. At runtime, the special section is treated as > > + * an array of these. > > + */ > > +struct codetag { > > + unsigned int flags; /* used in later patches */ > > + unsigned int lineno; > > + const char *modname; > > + const char *function; > > + const char *filename; > > +} __aligned(8); > > + > > +union codetag_ref { > > + struct codetag *ct; > > +}; > > + > > +struct codetag_range { > > + struct codetag *start; > > + struct codetag *stop; > > +}; > > + > > +struct codetag_module { > > + struct module *mod; > > + struct codetag_range range; > > +}; > > + > > +struct codetag_type_desc { > > + const char *section; > > + size_t tag_size; > > +}; > > + > > +struct codetag_iterator { > > + struct codetag_type *cttype; > > + struct codetag_module *cmod; > > + unsigned long mod_id; > > + struct codetag *ct; > > +}; > > + > > +#define CODE_TAG_INIT { \ > > + .modname = KBUILD_MODNAME, \ > > + .function = __func__, \ > > + .filename = __FILE__, \ > > + .lineno = __LINE__, \ > > + .flags = 0, \ > > +} > > + > > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock); > > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype); > > +struct codetag *codetag_next_ct(struct codetag_iterator *iter); > > + > > +void codetag_to_text(struct seq_buf *out, struct codetag *ct); > > + > > +struct codetag_type * > > +codetag_register_type(const struct codetag_type_desc *desc); > > + > > +#endif /* _LINUX_CODETAG_H */ > > diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug > > index 975a07f9f1cc..0be2d00c3696 100644 > > --- a/lib/Kconfig.debug > > +++ b/lib/Kconfig.debug > > @@ -968,6 +968,10 @@ config DEBUG_STACKOVERFLOW > > > > If in doubt, say "N". > > > > +config CODE_TAGGING > > + bool > > + select KALLSYMS > > + > > source "lib/Kconfig.kasan" > > source "lib/Kconfig.kfence" > > source "lib/Kconfig.kmsan" > > diff --git a/lib/Makefile b/lib/Makefile > > index 6b09731d8e61..6b48b22fdfac 100644 > > --- a/lib/Makefile > > +++ b/lib/Makefile > > @@ -235,6 +235,7 @@ obj-$(CONFIG_OF_RECONFIG_NOTIFIER_ERROR_INJECT) += \ > > of-reconfig-notifier-error-inject.o > > obj-$(CONFIG_FUNCTION_ERROR_INJECTION) += error-inject.o > > > > +obj-$(CONFIG_CODE_TAGGING) += codetag.o > > lib-$(CONFIG_GENERIC_BUG) += bug.o > > > > obj-$(CONFIG_HAVE_ARCH_TRACEHOOK) += syscall.o > > diff --git a/lib/codetag.c b/lib/codetag.c > > new file mode 100644 > > index 000000000000..7708f8388e55 > > --- /dev/null > > +++ b/lib/codetag.c > > @@ -0,0 +1,199 @@ > > +// SPDX-License-Identifier: GPL-2.0-only > > +#include <linux/codetag.h> > > +#include <linux/idr.h> > > +#include <linux/kallsyms.h> > > +#include <linux/module.h> > > +#include <linux/seq_buf.h> > > +#include <linux/slab.h> > > + > > +struct codetag_type { > > + struct list_head link; > > + unsigned int count; > > + struct idr mod_idr; > > + struct rw_semaphore mod_lock; /* protects mod_idr */ > > + struct codetag_type_desc desc; > > +}; > > + > > +static DEFINE_MUTEX(codetag_lock); > > +static LIST_HEAD(codetag_types); > > + > > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock) > > +{ > > + if (lock) > > + down_read(&cttype->mod_lock); > > + else > > + up_read(&cttype->mod_lock); > > +} > > + > > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype) > > +{ > > + struct codetag_iterator iter = { > > + .cttype = cttype, > > + .cmod = NULL, > > + .mod_id = 0, > > + .ct = NULL, > > + }; > > + > > + return iter; > > +} > > + > > +static inline struct codetag *get_first_module_ct(struct codetag_module *cmod) > > +{ > > + return cmod->range.start < cmod->range.stop ? cmod->range.start : NULL; > > +} > > + > > +static inline > > +struct codetag *get_next_module_ct(struct codetag_iterator *iter) > > +{ > > + struct codetag *res = (struct codetag *) > > + ((char *)iter->ct + iter->cttype->desc.tag_size); > > + > > + return res < iter->cmod->range.stop ? res : NULL; > > +} > > + > > +struct codetag *codetag_next_ct(struct codetag_iterator *iter) > > +{ > > + struct codetag_type *cttype = iter->cttype; > > + struct codetag_module *cmod; > > + struct codetag *ct; > > + > > + lockdep_assert_held(&cttype->mod_lock); > > + > > + if (unlikely(idr_is_empty(&cttype->mod_idr))) > > + return NULL; > > + > > + ct = NULL; > > + while (true) { > > + cmod = idr_find(&cttype->mod_idr, iter->mod_id); > > + > > + /* If module was removed move to the next one */ > > + if (!cmod) > > + cmod = idr_get_next_ul(&cttype->mod_idr, > > + &iter->mod_id); > > + > > + /* Exit if no more modules */ > > + if (!cmod) > > + break; > > + > > + if (cmod != iter->cmod) { > > + iter->cmod = cmod; > > + ct = get_first_module_ct(cmod); > > + } else > > + ct = get_next_module_ct(iter); > > + > > + if (ct) > > + break; > > + > > + iter->mod_id++; > > + } > > + > > + iter->ct = ct; > > + return ct; > > +} > > + > > +void codetag_to_text(struct seq_buf *out, struct codetag *ct) > > +{ > > + seq_buf_printf(out, "%s:%u module:%s func:%s", > > + ct->filename, ct->lineno, > > + ct->modname, ct->function); > > +} > > Thank you for using seq_buf here! > > Also, will this need an EXPORT_SYMBOL_GPL()? > > > + > > +static inline size_t range_size(const struct codetag_type *cttype, > > + const struct codetag_range *range) > > +{ > > + return ((char *)range->stop - (char *)range->start) / > > + cttype->desc.tag_size; > > +} > > + > > +static void *get_symbol(struct module *mod, const char *prefix, const char *name) > > +{ > > + char buf[64]; > > Why is 64 enough? I was expecting KSYM_NAME_LEN here, but perhaps this > is specialized enough to section names that it will not be a problem? This buffer is being used to hold the name of the section containing codetags appended with "__start_" or "__stop_" and the only current user is alloc_tag_init() which sets the section name to "alloc_tags". So, this buffer currently holds either "alloc_tags__start_" or "alloc_tags__stop_". When more codetag applications are added (like the ones we have shown in the original RFC [1]), there would be more section names. 64 was chosen as a big enough value to reasonably hold the section name with the suffix. But you are right, we should add a check for the section name size to ensure it always fits. Will add into my TODO list. [1] https://lore.kernel.org/all/20220830214919.53220-1-surenb@google.com/ > If so, please document it clearly with a comment. Will do. > > > + int res; > > + > > + res = snprintf(buf, sizeof(buf), "%s%s", prefix, name); > > + if (WARN_ON(res < 1 || res > sizeof(buf))) > > + return NULL; > > Please use a seq_buf here instead of snprintf, which we're trying to get > rid of. > > DECLARE_SEQ_BUF(sb, KSYM_NAME_LEN); > char *buf; > > seq_buf_printf(sb, "%s%s", prefix, name); > if (seq_buf_has_overflowed(sb)) > return NULL; > > buf = seq_buf_str(sb); Will do. Thanks! > > > + > > + return mod ? > > + (void *)find_kallsyms_symbol_value(mod, buf) : > > + (void *)kallsyms_lookup_name(buf); > > +} > > + > > +static struct codetag_range get_section_range(struct module *mod, > > + const char *section) > > +{ > > + return (struct codetag_range) { > > + get_symbol(mod, "__start_", section), > > + get_symbol(mod, "__stop_", section), > > + }; > > +} > > + > > +static int codetag_module_init(struct codetag_type *cttype, struct module *mod) > > +{ > > + struct codetag_range range; > > + struct codetag_module *cmod; > > + int err; > > + > > + range = get_section_range(mod, cttype->desc.section); > > + if (!range.start || !range.stop) { > > + pr_warn("Failed to load code tags of type %s from the module %s\n", > > + cttype->desc.section, > > + mod ? mod->name : "(built-in)"); > > + return -EINVAL; > > + } > > + > > + /* Ignore empty ranges */ > > + if (range.start == range.stop) > > + return 0; > > + > > + BUG_ON(range.start > range.stop); > > + > > + cmod = kmalloc(sizeof(*cmod), GFP_KERNEL); > > + if (unlikely(!cmod)) > > + return -ENOMEM; > > + > > + cmod->mod = mod; > > + cmod->range = range; > > + > > + down_write(&cttype->mod_lock); > > + err = idr_alloc(&cttype->mod_idr, cmod, 0, 0, GFP_KERNEL); > > + if (err >= 0) > > + cttype->count += range_size(cttype, &range); > > + up_write(&cttype->mod_lock); > > + > > + if (err < 0) { > > + kfree(cmod); > > + return err; > > + } > > + > > + return 0; > > +} > > + > > +struct codetag_type * > > +codetag_register_type(const struct codetag_type_desc *desc) > > +{ > > + struct codetag_type *cttype; > > + int err; > > + > > + BUG_ON(desc->tag_size <= 0); > > + > > + cttype = kzalloc(sizeof(*cttype), GFP_KERNEL); > > + if (unlikely(!cttype)) > > + return ERR_PTR(-ENOMEM); > > + > > + cttype->desc = *desc; > > + idr_init(&cttype->mod_idr); > > + init_rwsem(&cttype->mod_lock); > > + > > + err = codetag_module_init(cttype, NULL); > > + if (unlikely(err)) { > > + kfree(cttype); > > + return ERR_PTR(err); > > + } > > + > > + mutex_lock(&codetag_lock); > > + list_add_tail(&cttype->link, &codetag_types); > > + mutex_unlock(&codetag_lock); > > + > > + return cttype; > > +} > > -- > > 2.43.0.687.g38aa6559b0-goog > > > > -- > Kees Cook
On Mon, Feb 12, 2024 at 6:04 PM Suren Baghdasaryan <surenb@google.com> wrote: > > On Mon, Feb 12, 2024 at 2:27 PM Kees Cook <keescook@chromium.org> wrote: > > > > On Mon, Feb 12, 2024 at 01:38:56PM -0800, Suren Baghdasaryan wrote: > > > Add basic infrastructure to support code tagging which stores tag common > > > information consisting of the module name, function, file name and line > > > number. Provide functions to register a new code tag type and navigate > > > between code tags. > > > > > > Co-developed-by: Kent Overstreet <kent.overstreet@linux.dev> > > > Signed-off-by: Kent Overstreet <kent.overstreet@linux.dev> > > > Signed-off-by: Suren Baghdasaryan <surenb@google.com> > > > --- > > > include/linux/codetag.h | 71 ++++++++++++++ > > > lib/Kconfig.debug | 4 + > > > lib/Makefile | 1 + > > > lib/codetag.c | 199 ++++++++++++++++++++++++++++++++++++++++ > > > 4 files changed, 275 insertions(+) > > > create mode 100644 include/linux/codetag.h > > > create mode 100644 lib/codetag.c > > > > > > diff --git a/include/linux/codetag.h b/include/linux/codetag.h > > > new file mode 100644 > > > index 000000000000..a9d7adecc2a5 > > > --- /dev/null > > > +++ b/include/linux/codetag.h > > > @@ -0,0 +1,71 @@ > > > +/* SPDX-License-Identifier: GPL-2.0 */ > > > +/* > > > + * code tagging framework > > > + */ > > > +#ifndef _LINUX_CODETAG_H > > > +#define _LINUX_CODETAG_H > > > + > > > +#include <linux/types.h> > > > + > > > +struct codetag_iterator; > > > +struct codetag_type; > > > +struct seq_buf; > > > +struct module; > > > + > > > +/* > > > + * An instance of this structure is created in a special ELF section at every > > > + * code location being tagged. At runtime, the special section is treated as > > > + * an array of these. > > > + */ > > > +struct codetag { > > > + unsigned int flags; /* used in later patches */ > > > + unsigned int lineno; > > > + const char *modname; > > > + const char *function; > > > + const char *filename; > > > +} __aligned(8); > > > + > > > +union codetag_ref { > > > + struct codetag *ct; > > > +}; > > > + > > > +struct codetag_range { > > > + struct codetag *start; > > > + struct codetag *stop; > > > +}; > > > + > > > +struct codetag_module { > > > + struct module *mod; > > > + struct codetag_range range; > > > +}; > > > + > > > +struct codetag_type_desc { > > > + const char *section; > > > + size_t tag_size; > > > +}; > > > + > > > +struct codetag_iterator { > > > + struct codetag_type *cttype; > > > + struct codetag_module *cmod; > > > + unsigned long mod_id; > > > + struct codetag *ct; > > > +}; > > > + > > > +#define CODE_TAG_INIT { \ > > > + .modname = KBUILD_MODNAME, \ > > > + .function = __func__, \ > > > + .filename = __FILE__, \ > > > + .lineno = __LINE__, \ > > > + .flags = 0, \ > > > +} > > > + > > > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock); > > > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype); > > > +struct codetag *codetag_next_ct(struct codetag_iterator *iter); > > > + > > > +void codetag_to_text(struct seq_buf *out, struct codetag *ct); > > > + > > > +struct codetag_type * > > > +codetag_register_type(const struct codetag_type_desc *desc); > > > + > > > +#endif /* _LINUX_CODETAG_H */ > > > diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug > > > index 975a07f9f1cc..0be2d00c3696 100644 > > > --- a/lib/Kconfig.debug > > > +++ b/lib/Kconfig.debug > > > @@ -968,6 +968,10 @@ config DEBUG_STACKOVERFLOW > > > > > > If in doubt, say "N". > > > > > > +config CODE_TAGGING > > > + bool > > > + select KALLSYMS > > > + > > > source "lib/Kconfig.kasan" > > > source "lib/Kconfig.kfence" > > > source "lib/Kconfig.kmsan" > > > diff --git a/lib/Makefile b/lib/Makefile > > > index 6b09731d8e61..6b48b22fdfac 100644 > > > --- a/lib/Makefile > > > +++ b/lib/Makefile > > > @@ -235,6 +235,7 @@ obj-$(CONFIG_OF_RECONFIG_NOTIFIER_ERROR_INJECT) += \ > > > of-reconfig-notifier-error-inject.o > > > obj-$(CONFIG_FUNCTION_ERROR_INJECTION) += error-inject.o > > > > > > +obj-$(CONFIG_CODE_TAGGING) += codetag.o > > > lib-$(CONFIG_GENERIC_BUG) += bug.o > > > > > > obj-$(CONFIG_HAVE_ARCH_TRACEHOOK) += syscall.o > > > diff --git a/lib/codetag.c b/lib/codetag.c > > > new file mode 100644 > > > index 000000000000..7708f8388e55 > > > --- /dev/null > > > +++ b/lib/codetag.c > > > @@ -0,0 +1,199 @@ > > > +// SPDX-License-Identifier: GPL-2.0-only > > > +#include <linux/codetag.h> > > > +#include <linux/idr.h> > > > +#include <linux/kallsyms.h> > > > +#include <linux/module.h> > > > +#include <linux/seq_buf.h> > > > +#include <linux/slab.h> > > > + > > > +struct codetag_type { > > > + struct list_head link; > > > + unsigned int count; > > > + struct idr mod_idr; > > > + struct rw_semaphore mod_lock; /* protects mod_idr */ > > > + struct codetag_type_desc desc; > > > +}; > > > + > > > +static DEFINE_MUTEX(codetag_lock); > > > +static LIST_HEAD(codetag_types); > > > + > > > +void codetag_lock_module_list(struct codetag_type *cttype, bool lock) > > > +{ > > > + if (lock) > > > + down_read(&cttype->mod_lock); > > > + else > > > + up_read(&cttype->mod_lock); > > > +} > > > + > > > +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype) > > > +{ > > > + struct codetag_iterator iter = { > > > + .cttype = cttype, > > > + .cmod = NULL, > > > + .mod_id = 0, > > > + .ct = NULL, > > > + }; > > > + > > > + return iter; > > > +} > > > + > > > +static inline struct codetag *get_first_module_ct(struct codetag_module *cmod) > > > +{ > > > + return cmod->range.start < cmod->range.stop ? cmod->range.start : NULL; > > > +} > > > + > > > +static inline > > > +struct codetag *get_next_module_ct(struct codetag_iterator *iter) > > > +{ > > > + struct codetag *res = (struct codetag *) > > > + ((char *)iter->ct + iter->cttype->desc.tag_size); > > > + > > > + return res < iter->cmod->range.stop ? res : NULL; > > > +} > > > + > > > +struct codetag *codetag_next_ct(struct codetag_iterator *iter) > > > +{ > > > + struct codetag_type *cttype = iter->cttype; > > > + struct codetag_module *cmod; > > > + struct codetag *ct; > > > + > > > + lockdep_assert_held(&cttype->mod_lock); > > > + > > > + if (unlikely(idr_is_empty(&cttype->mod_idr))) > > > + return NULL; > > > + > > > + ct = NULL; > > > + while (true) { > > > + cmod = idr_find(&cttype->mod_idr, iter->mod_id); > > > + > > > + /* If module was removed move to the next one */ > > > + if (!cmod) > > > + cmod = idr_get_next_ul(&cttype->mod_idr, > > > + &iter->mod_id); > > > + > > > + /* Exit if no more modules */ > > > + if (!cmod) > > > + break; > > > + > > > + if (cmod != iter->cmod) { > > > + iter->cmod = cmod; > > > + ct = get_first_module_ct(cmod); > > > + } else > > > + ct = get_next_module_ct(iter); > > > + > > > + if (ct) > > > + break; > > > + > > > + iter->mod_id++; > > > + } > > > + > > > + iter->ct = ct; > > > + return ct; > > > +} > > > + > > > +void codetag_to_text(struct seq_buf *out, struct codetag *ct) > > > +{ > > > + seq_buf_printf(out, "%s:%u module:%s func:%s", > > > + ct->filename, ct->lineno, > > > + ct->modname, ct->function); > > > +} > > > > Thank you for using seq_buf here! > > > > Also, will this need an EXPORT_SYMBOL_GPL()? Missed this question. I don't think we need EXPORT_SYMBOL_GPL() here at least for now. Modules don't use these functions. The "alloc_tags" sections will be generated for each module at compile time but they themselves do not use it. > > > > > + > > > +static inline size_t range_size(const struct codetag_type *cttype, > > > + const struct codetag_range *range) > > > +{ > > > + return ((char *)range->stop - (char *)range->start) / > > > + cttype->desc.tag_size; > > > +} > > > + > > > +static void *get_symbol(struct module *mod, const char *prefix, const char *name) > > > +{ > > > + char buf[64]; > > > > Why is 64 enough? I was expecting KSYM_NAME_LEN here, but perhaps this > > is specialized enough to section names that it will not be a problem? > > This buffer is being used to hold the name of the section containing > codetags appended with "__start_" or "__stop_" and the only current > user is alloc_tag_init() which sets the section name to "alloc_tags". > So, this buffer currently holds either "alloc_tags__start_" or > "alloc_tags__stop_". When more codetag applications are added (like > the ones we have shown in the original RFC [1]), there would be more > section names. 64 was chosen as a big enough value to reasonably hold > the section name with the suffix. But you are right, we should add a > check for the section name size to ensure it always fits. Will add > into my TODO list. > > [1] https://lore.kernel.org/all/20220830214919.53220-1-surenb@google.com/ > > If so, please document it clearly with a comment. > > Will do. > > > > > > + int res; > > > + > > > + res = snprintf(buf, sizeof(buf), "%s%s", prefix, name); > > > + if (WARN_ON(res < 1 || res > sizeof(buf))) > > > + return NULL; > > > > Please use a seq_buf here instead of snprintf, which we're trying to get > > rid of. > > > > DECLARE_SEQ_BUF(sb, KSYM_NAME_LEN); > > char *buf; > > > > seq_buf_printf(sb, "%s%s", prefix, name); > > if (seq_buf_has_overflowed(sb)) > > return NULL; > > > > buf = seq_buf_str(sb); > > Will do. Thanks! > > > > > > + > > > + return mod ? > > > + (void *)find_kallsyms_symbol_value(mod, buf) : > > > + (void *)kallsyms_lookup_name(buf); > > > +} > > > + > > > +static struct codetag_range get_section_range(struct module *mod, > > > + const char *section) > > > +{ > > > + return (struct codetag_range) { > > > + get_symbol(mod, "__start_", section), > > > + get_symbol(mod, "__stop_", section), > > > + }; > > > +} > > > + > > > +static int codetag_module_init(struct codetag_type *cttype, struct module *mod) > > > +{ > > > + struct codetag_range range; > > > + struct codetag_module *cmod; > > > + int err; > > > + > > > + range = get_section_range(mod, cttype->desc.section); > > > + if (!range.start || !range.stop) { > > > + pr_warn("Failed to load code tags of type %s from the module %s\n", > > > + cttype->desc.section, > > > + mod ? mod->name : "(built-in)"); > > > + return -EINVAL; > > > + } > > > + > > > + /* Ignore empty ranges */ > > > + if (range.start == range.stop) > > > + return 0; > > > + > > > + BUG_ON(range.start > range.stop); > > > + > > > + cmod = kmalloc(sizeof(*cmod), GFP_KERNEL); > > > + if (unlikely(!cmod)) > > > + return -ENOMEM; > > > + > > > + cmod->mod = mod; > > > + cmod->range = range; > > > + > > > + down_write(&cttype->mod_lock); > > > + err = idr_alloc(&cttype->mod_idr, cmod, 0, 0, GFP_KERNEL); > > > + if (err >= 0) > > > + cttype->count += range_size(cttype, &range); > > > + up_write(&cttype->mod_lock); > > > + > > > + if (err < 0) { > > > + kfree(cmod); > > > + return err; > > > + } > > > + > > > + return 0; > > > +} > > > + > > > +struct codetag_type * > > > +codetag_register_type(const struct codetag_type_desc *desc) > > > +{ > > > + struct codetag_type *cttype; > > > + int err; > > > + > > > + BUG_ON(desc->tag_size <= 0); > > > + > > > + cttype = kzalloc(sizeof(*cttype), GFP_KERNEL); > > > + if (unlikely(!cttype)) > > > + return ERR_PTR(-ENOMEM); > > > + > > > + cttype->desc = *desc; > > > + idr_init(&cttype->mod_idr); > > > + init_rwsem(&cttype->mod_lock); > > > + > > > + err = codetag_module_init(cttype, NULL); > > > + if (unlikely(err)) { > > > + kfree(cttype); > > > + return ERR_PTR(err); > > > + } > > > + > > > + mutex_lock(&codetag_lock); > > > + list_add_tail(&cttype->link, &codetag_types); > > > + mutex_unlock(&codetag_lock); > > > + > > > + return cttype; > > > +} > > > -- > > > 2.43.0.687.g38aa6559b0-goog > > > > > > > -- > > Kees Cook
diff --git a/include/linux/codetag.h b/include/linux/codetag.h new file mode 100644 index 000000000000..a9d7adecc2a5 --- /dev/null +++ b/include/linux/codetag.h @@ -0,0 +1,71 @@ +/* SPDX-License-Identifier: GPL-2.0 */ +/* + * code tagging framework + */ +#ifndef _LINUX_CODETAG_H +#define _LINUX_CODETAG_H + +#include <linux/types.h> + +struct codetag_iterator; +struct codetag_type; +struct seq_buf; +struct module; + +/* + * An instance of this structure is created in a special ELF section at every + * code location being tagged. At runtime, the special section is treated as + * an array of these. + */ +struct codetag { + unsigned int flags; /* used in later patches */ + unsigned int lineno; + const char *modname; + const char *function; + const char *filename; +} __aligned(8); + +union codetag_ref { + struct codetag *ct; +}; + +struct codetag_range { + struct codetag *start; + struct codetag *stop; +}; + +struct codetag_module { + struct module *mod; + struct codetag_range range; +}; + +struct codetag_type_desc { + const char *section; + size_t tag_size; +}; + +struct codetag_iterator { + struct codetag_type *cttype; + struct codetag_module *cmod; + unsigned long mod_id; + struct codetag *ct; +}; + +#define CODE_TAG_INIT { \ + .modname = KBUILD_MODNAME, \ + .function = __func__, \ + .filename = __FILE__, \ + .lineno = __LINE__, \ + .flags = 0, \ +} + +void codetag_lock_module_list(struct codetag_type *cttype, bool lock); +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype); +struct codetag *codetag_next_ct(struct codetag_iterator *iter); + +void codetag_to_text(struct seq_buf *out, struct codetag *ct); + +struct codetag_type * +codetag_register_type(const struct codetag_type_desc *desc); + +#endif /* _LINUX_CODETAG_H */ diff --git a/lib/Kconfig.debug b/lib/Kconfig.debug index 975a07f9f1cc..0be2d00c3696 100644 --- a/lib/Kconfig.debug +++ b/lib/Kconfig.debug @@ -968,6 +968,10 @@ config DEBUG_STACKOVERFLOW If in doubt, say "N". +config CODE_TAGGING + bool + select KALLSYMS + source "lib/Kconfig.kasan" source "lib/Kconfig.kfence" source "lib/Kconfig.kmsan" diff --git a/lib/Makefile b/lib/Makefile index 6b09731d8e61..6b48b22fdfac 100644 --- a/lib/Makefile +++ b/lib/Makefile @@ -235,6 +235,7 @@ obj-$(CONFIG_OF_RECONFIG_NOTIFIER_ERROR_INJECT) += \ of-reconfig-notifier-error-inject.o obj-$(CONFIG_FUNCTION_ERROR_INJECTION) += error-inject.o +obj-$(CONFIG_CODE_TAGGING) += codetag.o lib-$(CONFIG_GENERIC_BUG) += bug.o obj-$(CONFIG_HAVE_ARCH_TRACEHOOK) += syscall.o diff --git a/lib/codetag.c b/lib/codetag.c new file mode 100644 index 000000000000..7708f8388e55 --- /dev/null +++ b/lib/codetag.c @@ -0,0 +1,199 @@ +// SPDX-License-Identifier: GPL-2.0-only +#include <linux/codetag.h> +#include <linux/idr.h> +#include <linux/kallsyms.h> +#include <linux/module.h> +#include <linux/seq_buf.h> +#include <linux/slab.h> + +struct codetag_type { + struct list_head link; + unsigned int count; + struct idr mod_idr; + struct rw_semaphore mod_lock; /* protects mod_idr */ + struct codetag_type_desc desc; +}; + +static DEFINE_MUTEX(codetag_lock); +static LIST_HEAD(codetag_types); + +void codetag_lock_module_list(struct codetag_type *cttype, bool lock) +{ + if (lock) + down_read(&cttype->mod_lock); + else + up_read(&cttype->mod_lock); +} + +struct codetag_iterator codetag_get_ct_iter(struct codetag_type *cttype) +{ + struct codetag_iterator iter = { + .cttype = cttype, + .cmod = NULL, + .mod_id = 0, + .ct = NULL, + }; + + return iter; +} + +static inline struct codetag *get_first_module_ct(struct codetag_module *cmod) +{ + return cmod->range.start < cmod->range.stop ? cmod->range.start : NULL; +} + +static inline +struct codetag *get_next_module_ct(struct codetag_iterator *iter) +{ + struct codetag *res = (struct codetag *) + ((char *)iter->ct + iter->cttype->desc.tag_size); + + return res < iter->cmod->range.stop ? res : NULL; +} + +struct codetag *codetag_next_ct(struct codetag_iterator *iter) +{ + struct codetag_type *cttype = iter->cttype; + struct codetag_module *cmod; + struct codetag *ct; + + lockdep_assert_held(&cttype->mod_lock); + + if (unlikely(idr_is_empty(&cttype->mod_idr))) + return NULL; + + ct = NULL; + while (true) { + cmod = idr_find(&cttype->mod_idr, iter->mod_id); + + /* If module was removed move to the next one */ + if (!cmod) + cmod = idr_get_next_ul(&cttype->mod_idr, + &iter->mod_id); + + /* Exit if no more modules */ + if (!cmod) + break; + + if (cmod != iter->cmod) { + iter->cmod = cmod; + ct = get_first_module_ct(cmod); + } else + ct = get_next_module_ct(iter); + + if (ct) + break; + + iter->mod_id++; + } + + iter->ct = ct; + return ct; +} + +void codetag_to_text(struct seq_buf *out, struct codetag *ct) +{ + seq_buf_printf(out, "%s:%u module:%s func:%s", + ct->filename, ct->lineno, + ct->modname, ct->function); +} + +static inline size_t range_size(const struct codetag_type *cttype, + const struct codetag_range *range) +{ + return ((char *)range->stop - (char *)range->start) / + cttype->desc.tag_size; +} + +static void *get_symbol(struct module *mod, const char *prefix, const char *name) +{ + char buf[64]; + int res; + + res = snprintf(buf, sizeof(buf), "%s%s", prefix, name); + if (WARN_ON(res < 1 || res > sizeof(buf))) + return NULL; + + return mod ? + (void *)find_kallsyms_symbol_value(mod, buf) : + (void *)kallsyms_lookup_name(buf); +} + +static struct codetag_range get_section_range(struct module *mod, + const char *section) +{ + return (struct codetag_range) { + get_symbol(mod, "__start_", section), + get_symbol(mod, "__stop_", section), + }; +} + +static int codetag_module_init(struct codetag_type *cttype, struct module *mod) +{ + struct codetag_range range; + struct codetag_module *cmod; + int err; + + range = get_section_range(mod, cttype->desc.section); + if (!range.start || !range.stop) { + pr_warn("Failed to load code tags of type %s from the module %s\n", + cttype->desc.section, + mod ? mod->name : "(built-in)"); + return -EINVAL; + } + + /* Ignore empty ranges */ + if (range.start == range.stop) + return 0; + + BUG_ON(range.start > range.stop); + + cmod = kmalloc(sizeof(*cmod), GFP_KERNEL); + if (unlikely(!cmod)) + return -ENOMEM; + + cmod->mod = mod; + cmod->range = range; + + down_write(&cttype->mod_lock); + err = idr_alloc(&cttype->mod_idr, cmod, 0, 0, GFP_KERNEL); + if (err >= 0) + cttype->count += range_size(cttype, &range); + up_write(&cttype->mod_lock); + + if (err < 0) { + kfree(cmod); + return err; + } + + return 0; +} + +struct codetag_type * +codetag_register_type(const struct codetag_type_desc *desc) +{ + struct codetag_type *cttype; + int err; + + BUG_ON(desc->tag_size <= 0); + + cttype = kzalloc(sizeof(*cttype), GFP_KERNEL); + if (unlikely(!cttype)) + return ERR_PTR(-ENOMEM); + + cttype->desc = *desc; + idr_init(&cttype->mod_idr); + init_rwsem(&cttype->mod_lock); + + err = codetag_module_init(cttype, NULL); + if (unlikely(err)) { + kfree(cttype); + return ERR_PTR(err); + } + + mutex_lock(&codetag_lock); + list_add_tail(&cttype->link, &codetag_types); + mutex_unlock(&codetag_lock); + + return cttype; +}