middle-end, c++, i386, libgcc: std::bfloat16_t and __bf16 arithmetic support

  On Fri, Sep 30, 2022 at 04:08:10PM +0200, Jakub Jelinek via Gcc-patches wrote:
> On Fri, Sep 30, 2022 at 09:49:08AM -0400, Jason Merrill wrote:
> > The comment from Apple on the ABI mangling proposal suggests to me that we
> > might want to delay enabling C++ std::bfloat16_t (i.e. defining
> > __STDCPP_BFLOAT16_T__) until we have that excess precision support?
> 
> I saw that comment.  We have similar problem with _Float16 too, where C++
> effectively right now works as when one uses -fexcess-precision=16 in C
> (which isn't default).
> I can see how hard would it be to add EXCESS_PRECISION_EXPR support to C++
> FE.

I've started on that but it will take some time.  That said, it should
work though less efficiently even without that, even in C users can always
select request such behavior with -fexcess-precision=16.

> > If we're using DF32x for _Float32x, maybe we want DF16b for bfloat16?
> 
> Perhaps, I just followed what was in the pull request.  Can change it.

Changed now, added support for the builtins and ported most of the
float16 tests, so that it gets at least some test coverage.
Also, for now I've left the aarch64 and arm changes out of the patch,
because I haven't tested it on aarch64 yet and arm support was incomplete
and I haven't heard from the ARM maintainers yet what they want or don't
want.

The added testcases showed a few problems.  One is that i?86 maintains
2 kinds of fp comparisons, trivial and non-trivial, the trivial which can
be handled by just a single conditional jump or setCC are handled directly,
while the complex ones which need two are not handled and the generic
code then figures it out using the trivial ones.  Unfortunately this means
that for == and != we end up with libcalls for it.  For _Float16, we have
added __nehf2 and __eqhf2 entrypoints last year.  I wanted to avoid doing
the same for __bf16, so I've added cbranchbf4 and cstorebf4 expanders
that handle all fp comparisons and internally just shift the operands up
to construct SFmode without even handling sNaNs and then call the generic
code to handle SFmode comparisons.

Another problem is for HFmode comparisons, when we see we don't support
directly some HFmode comparison, we iterate on wider scalar float modes
and look for usable comparisons, but BFmode and HFmode are unordered and
one of them has to appear as wider but neither is a subset nor superset,
so I had to skip wider modes which have equal precision to the starting one.
Yet another problem is because I've only enabled the bf16/BF16 suffixes in
C++ because for C it might clash with some later extension.  Am I right to
fear about that, or do you think C will never standardize suffixes that
would clash with that because C++ standardized the bf16/BF16 suffixes for
something already?  If I could enable it, I'd always pedwarn for C for those
and could enable the __BF16_*__ macros.  Right now I had to disable some
-fbuilding-libgcc macros because of that (though nothing really uses them
right now).

Another question is the suffixes of the builtins.  For now I have added
bf16 suffix and enabled the builtins with !both_p, so one always needs to
use __builtin_* form for them.  None of the GCC builtins end with b,
so this isn't ambiguous with __builtin_*f16, but some libm functions do end
with b, in particular ilogb, logb and f{??,??x}sub.  ilogb and the subs
always have it, but is __builtin_logbf16 f16 suffixed logb or bf16 suffixed
log?  Shall the builtins use f16b suffixes instead like the mangling does?

Full patch bootstrapped/regtested on x86_64-linux and i686-linux.

2022-10-04  Jakub Jelinek  <jakub@redhat.com>

gcc/
	* tree-core.h (enum tree_index): Add TI_BFLOAT16_TYPE.
	* tree.h (bfloat16_type_node): Define.
	(CASE_FLT_FN_FLOATN_NX): Also include BUILT_IN_*BF16.
	* tree.cc (excess_precision_type): Promote bfloat16_type_mode
	like float16_type_mode.
	(build_common_tree_nodes): Initialize bfloat16_type_node if
	BFmode is supported.
	* expmed.h (maybe_expand_shift): Declare.
	* expmed.cc (maybe_expand_shift): No longer static.
	(emit_store_flag_1): Don't consider [BH]Fmode as wider mode to
	narrower modes.
	* expr.cc (convert_mode_scalar): Don't ICE on BF -> HF or HF -> BF
	conversions.  If there is no optab, handle BF -> {DF,XF,TF,HF}
	conversions as separate BF -> SF -> {DF,XF,TF,HF} conversions, add
	-ffast-math generic implementation for BF -> SF and SF -> BF
	conversions.
	* builtin-types.def (BT_BFLOAT16, BT_FN_BFLOAT16,
	BT_FN_BFLOAT16_BFLOAT16, BT_FN_BFLOAT16_CONST_STRING,
	BT_FN_BFLOAT16_BFLOAT16_BFLOAT16,
	BT_FN_BFLOAT16_BFLOAT16_BFLOAT16_BFLOAT16): New.
	* builtins.def (DEF_GCC_FLOATN_NX_BUILTINS,
	DEF_EXT_LIB_FLOATN_NX_BUILTINS): Also add *bf16 suffixed builtins,
	but for these only __builtin_ prefixed functions.
	* optabs.cc (can_compare_p, prepare_cmp_insn): Don't consider
	[BH]Fmode as wider mode to narrower modes.
	* config/i386/i386.cc (classify_argument): Handle E_BCmode.
	(ix86_libgcc_floating_mode_supported_p): Also return true for BFmode
	for -msse2.
	(ix86_mangle_type): Mangle BFmode as DF16b.
	(ix86_invalid_conversion, ix86_invalid_unary_op,
	ix86_invalid_binary_op): Remove.
	(TARGET_INVALID_CONVERSION, TARGET_INVALID_UNARY_OP,
	TARGET_INVALID_BINARY_OP): Don't redefine.
	* config/i386/i386-builtins.cc (ix86_bf16_type_node): Remove.
	(ix86_register_bf16_builtin_type): Use bfloat16_type_node rather than
	ix86_bf16_type_node, only create it if still NULL.
	* config/i386/i386-builtin-types.def (BFLOAT16): Likewise.
	* config/i386/i386.md (cbranchbf4, cstorebf4): New expanders.
gcc/c-family/
	* c-cppbuiltin.cc (c_cpp_builtins): If bfloat16_type_node,
	predefine for C++ __BFLT16_*__ macros and for C++23 also
	__STDCPP_BFLOAT16_T__.
	* c-lex.cc (interpret_float): Handle CPP_N_BFLOAT16 for C++.
gcc/c/
	* c-typeck.cc (convert_arguments): Don't promote __bf16 to
	double.
gcc/cp/
	* cp-tree.h (extended_float_type_p): Return true for
	bfloat16_type_node.
	* typeck.cc (cp_compare_floating_point_conversion_ranks): Set
	extended{1,2} if mv{1,2} is bfloat16_type_node.  Adjust comment.
gcc/testsuite/
	* lib/target-supports.exp (check_effective_target_bfloat16,
	check_effective_target_bfloat16_runtime, add_options_for_bfloat16):
	New.
	* gcc.dg/torture/bfloat16-basic.c: New test.
	* gcc.dg/torture/bfloat16-builtin.c: New test.
	* gcc.dg/torture/bfloat16-builtin-issignaling-1.c: New test.
	* gcc.dg/torture/bfloat16-complex.c: New test.
	* gcc.dg/torture/builtin-issignaling-1.c: Allow to be includable
	from bfloat16-builtin-issignaling-1.c.
	* gcc.dg/torture/floatn-basic.h: Allow to be includable from
	bfloat16-basic.c.
	* gcc.dg/torture/floatn-builtin.h: Allow to be includable from
	bfloat16-builtin.c.
	* gcc.target/i386/vect-bfloat16-typecheck_2.c: Adjust expected
	diagnostics.
	* gcc.target/i386/sse2-bfloat16-scalar-typecheck.c: Likewise.
	* gcc.target/i386/vect-bfloat16-typecheck_1.c: Likewise.
	* g++.target/i386/bfloat_cpp_typecheck.C: Likewise.
libcpp/
	* include/cpplib.h (CPP_N_BFLOAT16): Define.
	* expr.cc (interpret_float_suffix): Handle bf16 and BF16 suffixes for
	C++.
libgcc/
	* config/i386/t-softfp (softfp_extensions): Add bfsf.
	(softfp_truncations): Add tfbf xfbf dfbf sfbf hfbf.
	(CFLAGS-extendbfsf2.c, CFLAGS-truncsfbf2.c, CFLAGS-truncdfbf2.c,
	CFLAGS-truncxfbf2.c, CFLAGS-trunctfbf2.c, CFLAGS-trunchfbf2.c): Add
	-msse2.
	* config/i386/libgcc-glibc.ver (GCC_13.0.0): Export
	__extendbfsf2 and __trunc{s,d,x,t,h}fbf2.
	* config/i386/sfp-machine.h (_FP_NANSIGN_B): Define.
	* config/i386/64/sfp-machine.h (_FP_NANFRAC_B): Define.
	* config/i386/32/sfp-machine.h (_FP_NANFRAC_B): Define.
	* soft-fp/brain.h: New file.
	* soft-fp/truncsfbf2.c: New file.
	* soft-fp/truncdfbf2.c: New file.
	* soft-fp/truncxfbf2.c: New file.
	* soft-fp/trunctfbf2.c: New file.
	* soft-fp/trunchfbf2.c: New file.
	* soft-fp/truncbfhf2.c: New file.
	* soft-fp/extendbfsf2.c: New file.
libiberty/
	* cp-demangle.h (D_BUILTIN_TYPE_COUNT): Increment.
	* cp-demangle.c (cplus_demangle_builtin_types): Add std::bfloat16_t
	entry.
	(cplus_demangle_type): Demangle DF16b.
	* testsuite/demangle-expected (_Z3xxxDF16b): New test.

	Jakub

Message ID	Yzv3kyZFBYlJpeyL@tucnak
State	New, archived
Headers	Return-Path: <gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org> Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4ac7:0:0:0:0:0 with SMTP id y7csp30747wrs; Tue, 4 Oct 2022 02:07:52 -0700 (PDT) X-Google-Smtp-Source: AMsMyM4cxFosue+BAszlYfoecCVQhVpCEAbWGHQZUFD4V1TQPgsobJtkhfGH1KZTu3X3PsVQdSEA X-Received: by 2002:a17:907:3f27:b0:78a:feb2:7f56 with SMTP id hq39-20020a1709073f2700b0078afeb27f56mr7782935ejc.295.1664874472294; Tue, 04 Oct 2022 02:07:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1664874472; cv=none; d=google.com; s=arc-20160816; b=e9Yec9yJJAR9ifsc0pQKAegkpFPK5EzS6CO0YdHa0Qu/LBQUwzhRCNi1ZCIZ2Ba27D p+u9yXWjAM2j6GYFN4a5lQmmtUQJZPcTJzvj3xYO6BYPMtF03f86rsQXFOZpbzHxFTTX 6EMJGoQ5ysUFUwW7PWqjuX70vjVhuxfhQkbdi/3Za8aXNBenl/zkV+bXsEhXT4x74Dv7 aPXlZFGg/Hu5+kaJAw8cfyQiRuR3rZT4g9F+8SvR4hXrEva+yi1YVPrc9tdAQYrDf2d/ VW7IUW1DRCbp/bnTnwfA6voFLTj5n6IiTmOYNNbzxWIlezh0FV+p9GYDV7/HqiScadQm /Ieg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:cc:reply-to:from:list-subscribe:list-help :list-post:list-archive:list-unsubscribe:list-id:precedence :content-disposition:in-reply-to:mime-version:references:message-id :subject:to:date:dmarc-filter:delivered-to:dkim-signature :dkim-filter; bh=KjQv5esuRBtmgbwWdA/jzRakrf15de1D7xju0KeN4I0=; b=QMqBT8eyYWBRA6Fv6/+uWd8gj7p3Lu99V0ONE1+Ji3gXhcJz/Pcmu/DjpjHs2lCT8X +J/Do1Om8CmYrkjnTEzPLLvxyDaSZgaWPBXxYgQCbn7MaljOsveSqk4j0q7sjiklSpwC 8LuRiDss2NbOPqpvrue1FgF345H37WjNNLpL7jIUUXpQOE5XBmLoSe4eFHy0yqHkIA5U 0hxI2uHAGjh1slHLFcRTlwv5kgzFZxo+i24mRBdsxVR7w3BQ/nUtcKF85bidda9cVQG8 qMZtOnQpjRigL5PwoTuH6Ja72BcmJFtAeXDuccNN+hD7b2tu4j/V4prv92BKUuSYQ/Ah wXAA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=Wlwh8noL; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from sourceware.org (server2.sourceware.org. [2620:52:3:1:0:246e:9693:128c]) by mx.google.com with ESMTPS id ho36-20020a1709070ea400b007811ace1701si12845525ejc.445.2022.10.04.02.07.51 for <ouuuleilei@gmail.com> (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 04 Oct 2022 02:07:52 -0700 (PDT) Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; Authentication-Results: mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=Wlwh8noL; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 71F5D385843D for <ouuuleilei@gmail.com>; Tue, 4 Oct 2022 09:07:49 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 71F5D385843D DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1664874469; bh=KjQv5esuRBtmgbwWdA/jzRakrf15de1D7xju0KeN4I0=; h=Date:To:Subject:References:In-Reply-To:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To:Cc: From; b=Wlwh8noLYjh40TmoHc3RsAP9Q80K14dsx7IvvSQUboQlkYNTgf6PPIZXxzd/d3gO9 iBgSit/KYFeR97/vb7kWDe9LggW98pXZdII7cs9M9pb1Vi+imtytxVrFnquxs5ync+ +mi0OqvfgFldTlQ4Pt+w+6Rke/XDcuUsinwu6ZVE= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by sourceware.org (Postfix) with ESMTPS id BEDC53858D1E for <gcc-patches@gcc.gnu.org>; Tue, 4 Oct 2022 09:06:41 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org BEDC53858D1E Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-498-CYpczokMOomoyqj9hdKhfQ-1; Tue, 04 Oct 2022 05:06:38 -0400 X-MC-Unique: CYpczokMOomoyqj9hdKhfQ-1 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.rdu2.redhat.com [10.11.54.3]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 75C51855420; Tue, 4 Oct 2022 09:06:38 +0000 (UTC) Received: from tucnak.zalov.cz (unknown [10.39.192.194]) by smtp.corp.redhat.com (Postfix) with ESMTPS id B86B71121314; Tue, 4 Oct 2022 09:06:37 +0000 (UTC) Received: from tucnak.zalov.cz (localhost [127.0.0.1]) by tucnak.zalov.cz (8.17.1/8.17.1) with ESMTPS id 29496TM43990264 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NOT); Tue, 4 Oct 2022 11:06:29 +0200 Received: (from jakub@localhost) by tucnak.zalov.cz (8.17.1/8.17.1/Submit) id 29496RGL3990263; Tue, 4 Oct 2022 11:06:27 +0200 Date: Tue, 4 Oct 2022 11:06:27 +0200 To: Jason Merrill <jason@redhat.com>, "Joseph S. Myers" <joseph@codesourcery.com>, Richard Biener <rguenther@suse.de>, Jeff Law <jeffreyalaw@gmail.com>, Uros Bizjak <ubizjak@gmail.com> Subject: [PATCH] middle-end, c++, i386, libgcc: std::bfloat16_t and __bf16 arithmetic support Message-ID: <Yzv3kyZFBYlJpeyL@tucnak> References: <YzXABvJX2wl3gHkK@tucnak> <37522634-319a-b471-aa35-87e711b0479e@redhat.com> <Yzb4SikTcfSimsIn@tucnak> MIME-Version: 1.0 In-Reply-To: <Yzb4SikTcfSimsIn@tucnak> X-Scanned-By: MIMEDefang 3.1 on 10.11.54.3 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Spam-Status: No, score=-3.6 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list <gcc-patches.gcc.gnu.org> List-Unsubscribe: <https://gcc.gnu.org/mailman/options/gcc-patches>, <mailto:gcc-patches-request@gcc.gnu.org?subject=unsubscribe> List-Archive: <https://gcc.gnu.org/pipermail/gcc-patches/> List-Post: <mailto:gcc-patches@gcc.gnu.org> List-Help: <mailto:gcc-patches-request@gcc.gnu.org?subject=help> List-Subscribe: <https://gcc.gnu.org/mailman/listinfo/gcc-patches>, <mailto:gcc-patches-request@gcc.gnu.org?subject=subscribe> From: Jakub Jelinek via Gcc-patches <gcc-patches@gcc.gnu.org> Reply-To: Jakub Jelinek <jakub@redhat.com> Cc: gcc-patches@gcc.gnu.org Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org Sender: "Gcc-patches" <gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org> X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1745747414292971238?= X-GMAIL-MSGID: =?utf-8?q?1745747414292971238?=
Series	middle-end, c++, i386, libgcc: std::bfloat16_t and __bf16 arithmetic support \| middle-end, c++, i386, libgcc: std::bfloat16_t and __bf16 arithmetic support

middle-end, c++, i386, libgcc: std::bfloat16_t and __bf16 arithmetic support

Commit Message

Comments

Patch