From patchwork Wed Jul 19 03:28:22 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Li, Pan2 via Gcc-patches" X-Patchwork-Id: 122366 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:c923:0:b0:3e4:2afc:c1 with SMTP id j3csp2166303vqt; Tue, 18 Jul 2023 20:29:21 -0700 (PDT) X-Google-Smtp-Source: APBJJlGUVWKNVfFMRG9ig0bIyjrCcbDegcBS1hu8q9SuNL6Sf0QJ7gtLCiH+i+u1BCJVNWHWYm7k X-Received: by 2002:a17:906:14e:b0:97b:6c24:8d2b with SMTP id 14-20020a170906014e00b0097b6c248d2bmr1524933ejh.55.1689737361664; Tue, 18 Jul 2023 20:29:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689737361; cv=none; d=google.com; s=arc-20160816; b=ubKmIf+6f1CBC6W85GxDNJ+N6AV1gLz9UnMFMBkv2w5x4FtK57a2PYcMQVF2m5FGpd JFqELaLL9hn2ey2xCBwGavDO3jdZ7HyLIYV9VMQnZKj0vF6ASxahsBeozftMJaJzu6N5 mkCp2B/bbt8OH7OttALguYYd8OpKVYHZFNoKwPhk6ge0Leh4t4l9nyDtrO54P+paWSpr JDYDWYyFsV/hSE87dSEXXHU0hsihdA1rlecmuONIA3FUqixivOtWdhQ0Ze5oqMR4Dl9q UuVilc7Aaea881zu1h7HT6iqW3yznCRsuvE9+rZJtrO5az77shbqSGRbDgjtcicX5gPP 1ozQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:from:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence :content-transfer-encoding:mime-version:message-id:date:subject:cc :to:dmarc-filter:delivered-to:dkim-signature:dkim-filter; bh=ZOXjsZtLiLTBIdinIUp4qVHACaIbQc1xiOC6tNvUO6Y=; fh=9EZcvfVI324lLsRm78AR5trG4A9hqVpSpoTtyXFX1N4=; b=EMPjZhj7xvkTCO3H33d51z0uE2NS2WNsnOCBJ2SEGnEtl/QtbIw0q3D9Rdc6IMJDWN XwVdKnTLwgsZlfaEU0CNDg6+M2ixniwxBwZzbOw90CF8496Slu0Y9koa9hX+AYq9VjNb He0KLbmJYmIuwltPI+Y3itCMBq5Klz9wgfLBZfF1Ep9WO8YdlkzudiHVQIazG09GXg0H z+55KTWoJgd19BARMBV/xkddgX09L5dXS1ku9Jr0HE54uq6YNyqtqyLFYq1JHXyR6qIZ gEXIh3oLL51LrAJu3uTDL3az1daXi8NUasOr2TLOmN7Kd8wU4lJTSz9uVrAvuuzbMZ1w gr2g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=KUtGqTRH; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from server2.sourceware.org (server2.sourceware.org. [2620:52:3:1:0:246e:9693:128c]) by mx.google.com with ESMTPS id a3-20020a170906368300b00993ee77a3c4si2054283ejc.681.2023.07.18.20.29.21 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 18 Jul 2023 20:29:21 -0700 (PDT) Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; Authentication-Results: mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=KUtGqTRH; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 632D7385AFAC for ; Wed, 19 Jul 2023 03:29:20 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 632D7385AFAC DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1689737360; bh=ZOXjsZtLiLTBIdinIUp4qVHACaIbQc1xiOC6tNvUO6Y=; h=To:Cc:Subject:Date:List-Id:List-Unsubscribe:List-Archive: List-Post:List-Help:List-Subscribe:From:Reply-To:From; b=KUtGqTRHNwM8Ng2w67Z7MMIurN4uSeRL8o7cdhrM3JBe2vwsaAq7+NCSEJKjjL1nm Yk9+EfFeEAX4FugfvVWoWlOOjFsZeR11ySn1u3jGGllrU/wKnyxuifPYji+f3Ye4fB O1eNg0RQbvMQTjp05Hs/mGLzE4mgfYWETCUbpP40= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from mga02.intel.com (mga02.intel.com [134.134.136.20]) by sourceware.org (Postfix) with ESMTPS id 2AF223858C74 for ; Wed, 19 Jul 2023 03:28:32 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 2AF223858C74 X-IronPort-AV: E=McAfee;i="6600,9927,10775"; a="356309745" X-IronPort-AV: E=Sophos;i="6.01,215,1684825200"; d="scan'208";a="356309745" Received: from orsmga002.jf.intel.com ([10.7.209.21]) by orsmga101.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 18 Jul 2023 20:28:30 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10775"; a="723851683" X-IronPort-AV: E=Sophos;i="6.01,215,1684825200"; d="scan'208";a="723851683" Received: from shvmail03.sh.intel.com ([10.239.245.20]) by orsmga002.jf.intel.com with ESMTP; 18 Jul 2023 20:28:24 -0700 Received: from pli-ubuntu.sh.intel.com (pli-ubuntu.sh.intel.com [10.239.159.47]) by shvmail03.sh.intel.com (Postfix) with ESMTP id C98BE1007BD7; Wed, 19 Jul 2023 11:28:23 +0800 (CST) To: gcc-patches@gcc.gnu.org Cc: juzhe.zhong@rivai.ai, pan2.li@intel.com, yanzhang.wang@intel.com, kito.cheng@gmail.com Subject: [PATCH v1] RISC-V: Support CALL for RVV floating-point dynamic rounding Date: Wed, 19 Jul 2023 11:28:22 +0800 Message-Id: <20230719032822.85817-1-pan2.li@intel.com> X-Mailer: git-send-email 2.34.1 MIME-Version: 1.0 X-Spam-Status: No, score=-11.0 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, GIT_PATCH_0, KAM_SHORT, RCVD_IN_MSPIKE_H3, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Pan Li via Gcc-patches From: "Li, Pan2 via Gcc-patches" Reply-To: pan2.li@intel.com Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org Sender: "Gcc-patches" X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1771818043706323395 X-GMAIL-MSGID: 1771818043706323395 From: Pan Li In basic dynamic rounding mode, we simply ignore call instructions and we would like to take care of call in this PATCH. During the call, the frm may be updated or keep as is. Thus, we must make sure at least 2 things. 1. The static frm before call should not pollute the frm value in call. 2. The updated frm value in call should be sticky after call completed. We will perfrom some steps to make above happen. 1. Mark call instruction with new mode DYN_CALL. 2. Mark the instruction after CALL from NONE to DYN. 3. When emit for a DYN_CALL, we will restore the frm value. 4. When emit from a DYN_CALL, we will backup the frm value. Let's take a flow for this. +-------------+ | Entry (DYN) | <- frrm a5 +-------------+ / \ +-------+ +-----------+ | VFADD | | VFADD RTZ | <- fsrmi 1(RTZ) +-------+ +-----------+ | | +-------+ +-----------+ | CALL | | CALL | <- fsrm a5 +-------+ +-----------+ | | +-----------+ +-------+ | SHIFT | <- frrm a5 | VFADD | <- frrm a5 +-----------+ +-------+ | / +-----------+ / | VFADD RUP | <- fsrm1 3(RUP) +-----------+ / \ / +-----------------+ | Exit (DYN_EXIT) | <- fsrm a5 +-----------------+ Please *NOTE* some corn cases like no instruction after a call is not well handled, and will be coverred in another PATCH(s) soon. Signed-off-by: Pan Li Co-Authored-By: Juzhe-Zhong gcc/ChangeLog: * config/riscv/riscv.cc (struct machine_function): Add new field static_frm_p. (riscv_emit_frm_mode_set): Add DYN_CALL emit. (riscv_frm_mode_needed): New function for frm mode needed. (riscv_mode_needed): Extrac function for frm. (riscv_frm_mode_after): Add DYN_CALL after. * config/riscv/vector.md (frm_mode): Add dyn_call. (fsrmsi_restore_exit): Rename to _volatile. (fsrmsi_restore_volatile): Likewise. gcc/testsuite/ChangeLog: * gcc.target/riscv/rvv/base/float-point-frm-insert-7.c: Fix tests cases. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-33.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-34.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-35.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-36.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-37.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-38.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-39.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-40.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-41.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-42.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-43.c: New test. * gcc.target/riscv/rvv/base/float-point-dynamic-frm-44.c: New test. * gcc.target/riscv/rvv/base/float-point-frm-run-4.c: New test. * gcc.target/riscv/rvv/base/float-point-frm-run-5.c: New test. Signed-off-by: Pan Li --- gcc/config/riscv/riscv.cc | 73 +++++++++++++++- gcc/config/riscv/vector.md | 4 +- .../rvv/base/float-point-dynamic-frm-33.c | 31 +++++++ .../rvv/base/float-point-dynamic-frm-34.c | 32 +++++++ .../rvv/base/float-point-dynamic-frm-35.c | 32 +++++++ .../rvv/base/float-point-dynamic-frm-36.c | 29 +++++++ .../rvv/base/float-point-dynamic-frm-37.c | 36 ++++++++ .../rvv/base/float-point-dynamic-frm-38.c | 34 ++++++++ .../rvv/base/float-point-dynamic-frm-39.c | 36 ++++++++ .../rvv/base/float-point-dynamic-frm-40.c | 34 ++++++++ .../rvv/base/float-point-dynamic-frm-41.c | 37 +++++++++ .../rvv/base/float-point-dynamic-frm-42.c | 37 +++++++++ .../rvv/base/float-point-dynamic-frm-43.c | 38 +++++++++ .../rvv/base/float-point-dynamic-frm-44.c | 40 +++++++++ .../riscv/rvv/base/float-point-frm-insert-7.c | 5 +- .../riscv/rvv/base/float-point-frm-run-4.c | 82 ++++++++++++++++++ .../riscv/rvv/base/float-point-frm-run-5.c | 83 +++++++++++++++++++ 17 files changed, 655 insertions(+), 8 deletions(-) create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-33.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-34.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-35.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-36.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-37.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-38.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-39.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-40.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-41.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-42.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-43.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-44.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-4.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-5.c diff --git a/gcc/config/riscv/riscv.cc b/gcc/config/riscv/riscv.cc index a45c52a2437..e65520ee8a2 100644 --- a/gcc/config/riscv/riscv.cc +++ b/gcc/config/riscv/riscv.cc @@ -151,6 +151,10 @@ struct GTY(()) machine_function { /* The RTL variable which stores the dynamic FRM value. We always use this RTX to restore dynamic FRM rounding mode in mode switching. */ rtx dynamic_frm; + + /* The boolean variables indicates there is at least one static rounding + mode instruction in the function or not. */ + bool static_frm_p; }; /* Information about a single argument. */ @@ -7669,9 +7673,13 @@ riscv_static_frm_mode_p (int mode) static void riscv_emit_frm_mode_set (int mode, int prev_mode) { + rtx backup_reg = cfun->machine->dynamic_frm; + + if (prev_mode == FRM_MODE_DYN_CALL) + emit_insn (gen_frrmsi (backup_reg)); /* Backup frm when DYN_CALL. */ + if (mode != prev_mode) { - rtx backup_reg = cfun->machine->dynamic_frm; /* TODO: By design, FRM_MODE_xxx used by mode switch which is different from the FRM value like FRM_RTZ defined in riscv-protos.h. When mode switching we actually need a conversion @@ -7681,9 +7689,13 @@ riscv_emit_frm_mode_set (int mode, int prev_mode) and then we leverage this assumption when emit. */ rtx frm = gen_int_mode (mode, SImode); - if (mode == FRM_MODE_DYN_EXIT && prev_mode != FRM_MODE_DYN) + if (mode == FRM_MODE_DYN_CALL && prev_mode != FRM_MODE_DYN) /* No need to emit when prev mode is DYN already. */ - emit_insn (gen_fsrmsi_restore_exit (backup_reg)); + emit_insn (gen_fsrmsi_restore_volatile (backup_reg)); + else if (mode == FRM_MODE_DYN_EXIT && prev_mode != FRM_MODE_DYN + && prev_mode != FRM_MODE_DYN_CALL && cfun->machine->static_frm_p) + /* No need to emit when prev mode is DYN or DYN_CALL already. */ + emit_insn (gen_fsrmsi_restore_volatile (backup_reg)); else if (mode == FRM_MODE_DYN) /* Restore frm value from backup when switch to DYN mode. */ emit_insn (gen_fsrmsi_restore (backup_reg)); @@ -7713,6 +7725,53 @@ riscv_emit_mode_set (int entity, int mode, int prev_mode, } } +/* Return mode that frm must be switched into + prior to the execution of insn. */ + +static int +riscv_frm_mode_needed (rtx_insn *cur_insn, int code) +{ + if (CALL_P (cur_insn)) + return FRM_MODE_DYN_CALL; + + int mode = code >= 0 ? get_attr_frm_mode (cur_insn) : FRM_MODE_NONE; + + if (mode == FRM_MODE_NONE) + { + /* After meet a call, we need to backup the frm because it may be + updated during the call. Here, for each insn, we will check if + the previous insn is a call or not. When previous insn is call, + there will be 2 cases for the emit mode set. + + 1. Current insn is not MODE_NONE, then the mode switch framework + will do the mode switch from MODE_CALL to MODE_NON_NONE natively. + 2. Current insn is MODE_NONE, we need to adjust the MODE_NONE to + the MODE_DYN, and leave the mode switch itself to perform + the emit mode set. + + TODO: this cannot handle one case if there is no instruction + after a call, we will take care of it soon. + */ + rtx_insn *insn; + basic_block bb = BLOCK_FOR_INSN (cur_insn); + + for (insn = PREV_INSN (cur_insn); insn; insn = PREV_INSN (insn)) + { + if (INSN_P (insn)) + { + if (CALL_P (insn)) + mode = FRM_MODE_DYN; + break; + } + + if (insn == PREV_INSN (BB_HEAD (bb))) + break; + } + } + + return mode; +} + /* Return mode that entity must be switched into prior to the execution of insn. */ @@ -7726,7 +7785,7 @@ riscv_mode_needed (int entity, rtx_insn *insn) case RISCV_VXRM: return code >= 0 ? get_attr_vxrm_mode (insn) : VXRM_MODE_NONE; case RISCV_FRM: - return code >= 0 ? get_attr_frm_mode (insn) : FRM_MODE_NONE; + return riscv_frm_mode_needed (insn, code); default: gcc_unreachable (); } @@ -7803,6 +7862,12 @@ riscv_vxrm_mode_after (rtx_insn *insn, int mode) static int riscv_frm_mode_after (rtx_insn *insn, int mode) { + cfun->machine->static_frm_p = cfun->machine->static_frm_p + || riscv_static_frm_mode_p (mode); + + if (CALL_P (insn)) + return FRM_MODE_DYN_CALL; + if (frm_unknown_dynamic_p (insn)) return FRM_MODE_DYN; diff --git a/gcc/config/riscv/vector.md b/gcc/config/riscv/vector.md index 215ecb9cb58..8b354e593ce 100644 --- a/gcc/config/riscv/vector.md +++ b/gcc/config/riscv/vector.md @@ -475,7 +475,7 @@ (define_attr "vxrm_mode" "rnu,rne,rdn,rod,none" ) ;; Defines rounding mode of an floating-point operation. -(define_attr "frm_mode" "rne,rtz,rdn,rup,rmm,dyn,dyn_exit,none" +(define_attr "frm_mode" "rne,rtz,rdn,rup,rmm,dyn,dyn_exit,dyn_call,none" (cond [ (eq_attr "type" "vfalu") @@ -610,7 +610,7 @@ (define_insn "fsrmsi_restore" ;; The volatile fsrmsi restore is used for the exit point for the ;; dynamic mode switching. It will generate one volatile fsrm a5 ;; which won't be eliminated. -(define_insn "fsrmsi_restore_exit" +(define_insn "fsrmsi_restore_volatile" [(set (reg:SI FRM_REGNUM) (unspec_volatile:SI [(match_operand:SI 0 "register_operand" "r")] UNSPECV_FRM_RESTORE_EXIT))] diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-33.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-33.c new file mode 100644 index 00000000000..4bd520ea2af --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-33.c @@ -0,0 +1,31 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + vl = normalize_vl (vl); + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-34.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-34.c new file mode 100644 index 00000000000..6c7cf7ef69c --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-34.c @@ -0,0 +1,32 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + + vl = normalize_vl (vl); + + return vl > 128 ? result : op2; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-35.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-35.c new file mode 100644 index 00000000000..b7f5a6919f4 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-35.c @@ -0,0 +1,32 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-36.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-36.c new file mode 100644 index 00000000000..4485cea24d9 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-36.c @@ -0,0 +1,29 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + if (vl % 4 != 0) + vl = normalize_vl (vl); + + return vl > 16 ? result : op2; +} + +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[01234]} } } */ +/* { dg-final { scan-assembler-not {fsrm\s+[axs][0-9]+} } } */ +/* { dg-final { scan-assembler-not {frrm\s+[axs][0-9]+} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-37.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-37.c new file mode 100644 index 00000000000..a1fca1a2a3f --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-37.c @@ -0,0 +1,36 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, + int count, size_t vl) +{ + vfloat32m1_t result = op1; + + for (int i = 0; i < count; i++) + { + if (i % 2 == 0) + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + else + vl = normalize_vl (vl); + } + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 1 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-38.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-38.c new file mode 100644 index 00000000000..8d59cae9a87 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-38.c @@ -0,0 +1,34 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, + int count, size_t vl) +{ + vfloat32m1_t result = op1; + + for (int i = 0; i < count; i++) + { + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + vl = normalize_vl (vl); + } + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 1 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-39.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-39.c new file mode 100644 index 00000000000..04c54877393 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-39.c @@ -0,0 +1,36 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, + int count, size_t vl) +{ + vfloat32m1_t result = op1; + + for (int i = 0; i < count; i++) + { + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + if (vl % 8 != 0) + vl = normalize_vl (vl); + } + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 1 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-40.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-40.c new file mode 100644 index 00000000000..49cf52f739b --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-40.c @@ -0,0 +1,34 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-41.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-41.c new file mode 100644 index 00000000000..79ef55b2c9f --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-41.c @@ -0,0 +1,37 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 4, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 3 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-42.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-42.c new file mode 100644 index 00000000000..d2a17ad715f --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-42.c @@ -0,0 +1,37 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1 (result, op2, vl); + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-43.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-43.c new file mode 100644 index 00000000000..50e1da2c3c1 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-43.c @@ -0,0 +1,38 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + + if (vl % 16 == 0) + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 1, vl); + + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1 (op1, result, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-44.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-44.c new file mode 100644 index 00000000000..a66ca89308b --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-dynamic-frm-44.c @@ -0,0 +1,40 @@ +/* { dg-do compile } */ +/* { dg-options "-march=rv64gcv -mabi=lp64 -O3 -Wno-psabi" } */ + +#include "riscv_vector.h" + +size_t __attribute__ ((noinline)) +normalize_vl (size_t vl) +{ + if (vl % 4 == 0) + return vl; + + return ((vl / 4) + 1) * 4; +} + +vfloat32m1_t +test_float_point_dynamic_frm (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op1; + + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 4, vl); + + vl = normalize_vl (vl); + + if (vl % 16 == 0) + result = __riscv_vfadd_vv_f32m1_rm (result, op2, 1, vl); + + + if (vl % 7 != 0) + vl = normalize_vl (vl); + + result = __riscv_vfadd_vv_f32m1 (op1, result, vl); + + return result; +} + +/* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 3 } } */ +/* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 4 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-insert-7.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-insert-7.c index 12db112dd0b..6de9d06b875 100644 --- a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-insert-7.c +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-insert-7.c @@ -26,6 +26,7 @@ test_float_point_frm_static (float *out, vfloat32m1_t op1, vfloat32m1_t op2, } /* { dg-final { scan-assembler-times {vfadd\.v[vf]\s+v[0-9]+,\s*v[0-9]+,\s*[fav]+[0-9]+} 2 } } */ -/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 1 } } */ -/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 1 } } */ +/* { dg-final { scan-assembler-times {frrm\s+[axs][0-9]+} 2 } } */ +/* { dg-final { scan-assembler-times {fsrm\s+[axs][0-9]+} 2 } } */ /* { dg-final { scan-assembler-times {fsrmi\s+[01234]} 2 } } */ +/* { dg-final { scan-assembler-not {fsrmi\s+[axs][0-9]+,\s*[01234]} } } */ diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-4.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-4.c new file mode 100644 index 00000000000..5796aa53a73 --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-4.c @@ -0,0 +1,82 @@ +/* { dg-do run { target { riscv_vector } } } */ +/* { dg-options "-O3 -Wno-psabi" } */ + +#include "riscv_vector.h" +#include +#include + +#define ORIGINAL_FRM 1 + +static int +get_frm () +{ + int frm = -1; + + __asm__ volatile ( + "frrm %0" + :"=r"(frm) + : + : + ); + + return frm; +} + +static void +set_frm (int frm) +{ + __asm__ volatile ( + "fsrm %0" + : + :"r"(frm) + : + ); +} + +static inline void +assert_equal (int a, int b, char *message) +{ + if (a != b) + { + fprintf (stdout, "%s, but get %d != %d\n", message, a, b); + __builtin_abort (); + } +} + +vfloat32m1_t __attribute__ ((noinline)) +other_function (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = op2; + + result = __riscv_vfadd_vv_f32m1 (op1, result, vl); + + assert_equal (ORIGINAL_FRM, get_frm (), + "The value of frm register should be ORIGINAL_FRM."); + + return result; +} + +vfloat32m1_t __attribute__ ((noinline)) +test_float_point_frm_run (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = {}; + + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 4, vl); + + assert_equal (4, get_frm (), "The value of frm register should be 4."); + + return other_function (result, op2, vl); +} + +int +main () +{ + size_t vl = 8; + vfloat32m1_t op1 = {}; + vfloat32m1_t op2 = {}; + + set_frm (ORIGINAL_FRM); + test_float_point_frm_run (op1, op2, vl); + + return 0; +} diff --git a/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-5.c b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-5.c new file mode 100644 index 00000000000..208a65fcd3a --- /dev/null +++ b/gcc/testsuite/gcc.target/riscv/rvv/base/float-point-frm-run-5.c @@ -0,0 +1,83 @@ +/* { dg-do run { target { riscv_vector } } } */ +/* { dg-options "-O3 -Wno-psabi" } */ + +#include "riscv_vector.h" +#include +#include + +#define ORIGINAL_FRM 1 +#define NEW_FRM 4 + +static int +get_frm () +{ + int frm = -1; + + __asm__ volatile ( + "frrm %0" + :"=r"(frm) + : + : + ); + + return frm; +} + +static void +set_frm (int frm) +{ + __asm__ volatile ( + "fsrm %0" + : + :"r"(frm) + : + ); +} + +static inline void +assert_equal (int a, int b, char *message) +{ + if (a != b) + { + fprintf (stdout, "%s, but get %d != %d\n", message, a, b); + __builtin_abort (); + } +} + +void __attribute__ ((noinline)) +other_function () +{ + set_frm (NEW_FRM); +} + +vfloat32m1_t __attribute__ ((noinline)) +test_float_point_frm_run (vfloat32m1_t op1, vfloat32m1_t op2, size_t vl) +{ + vfloat32m1_t result = {}; + + other_function (); + assert_equal (NEW_FRM, get_frm (), + "The value of frm register should be NEW_FRM."); + + result = __riscv_vfadd_vv_f32m1_rm (op1, result, 2, vl); + assert_equal (2, get_frm (), "The value of frm register should be 2."); + + result = __riscv_vfadd_vv_f32m1 (op1, result, vl); + assert_equal (NEW_FRM, get_frm (), + "The value of frm register should be NEW_FRM."); + + return result; +} + +int +main () +{ + size_t vl = 8; + vfloat32m1_t op1 = {}; + vfloat32m1_t op2 = {}; + + set_frm (ORIGINAL_FRM); + test_float_point_frm_run (op1, op2, vl); + + return 0; +}