diff mbox series

[committed] amdgcn: add fmin/fmax patterns

Message ID	500fa1bc-9f12-c29e-e377-8b728727cf3b@codesourcery.com
State	Unresolved
Headers	Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) client-ip=8.43.85.97; DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org F1A1C385DC00 IronPort-SDR: 6+ET40xsCiAEGoKT2qWzj920eY22agvwhuZqLnMyGEobwuCIL0i85T8KRzyAa4ywoptRgVR7U5 yezjrRZFU96FMbea8Yr5vUocLhzB9TxHmo4ef4gd/wKavfOkHhQbfX5K8/Yln68Ty44NP9SfMH 2RbAzCzcGCZdc6F/R9baTHuhtrferEFShURxaq6scNatEAW0OdqiQOSQq4UthHSxQjXBdbKfCE L7WYA3Qo2TJhy3f5SKOzbSEMds2E1js0vwuzFcihbEvmXUSf9ORTLiL5zRgOyFruPZO1wgWwKp Kp4= Content-Type: multipart/mixed; boundary="------------icEotwdfoSKFSDrowcgUQaLA" Message-ID: <500fa1bc-9f12-c29e-e377-8b728727cf3b@codesourcery.com> Date: Mon, 31 Oct 2022 13:03:13 +0000 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.4.1 Content-Language: en-GB From: Andrew Stubbs <ams@codesourcery.com> Subject: [committed] amdgcn: add fmin/fmax patterns To: "gcc-patches@gcc.gnu.org" <gcc-patches@gcc.gnu.org> Precedence: list Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org Sender: "Gcc-patches" <gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org> X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?=
Series	[committed] amdgcn: add fmin/fmax patterns \| [committed] amdgcn: add fmin/fmax patterns

Checks

Context	Check	Description
snail/gcc-patch-check	warning	Git am fail log

Commit Message

Andrew Stubbs Oct. 31, 2022, 1:03 p.m. UTC

  This patch adds patterns for the fmin and fmax operators, for scalars, 
vectors, and vector reductions.

The compiler uses smin and smax for most floating-point optimizations, 
etc., but not where the user calls fmin/fmax explicitly.  On amdgcn the 
hardware min/max instructions are already IEEE compliant w.r.t. 
unordered values, so there's no need for separate implementations.

Andrew
amdgcn: add fmin/fmax patterns

Add fmin/fmax for scalar, vector, and reductions.  The smin/smax patterns are
already using the IEEE compliant hardware instructions anyway, so we can just
expand to use those insns.

gcc/ChangeLog:

	* config/gcn/gcn-valu.md (fminmaxop): New iterator.
	(<fexpander><mode>3): New define_expand.
	(<fexpander><mode>3<exec>): Likewise.
	(reduc_<fexpander>_scal_<mode>): Likewise.
	* config/gcn/gcn.md (fexpander): New attribute.

diff mbox series

Patch

diff --git a/gcc/config/gcn/gcn-valu.md b/gcc/config/gcn/gcn-valu.md
index 6274d2e9228..3b619512e13 100644
--- a/gcc/config/gcn/gcn-valu.md
+++ b/gcc/config/gcn/gcn-valu.md
@@ -2466,6 +2466,23 @@  (define_insn "<expander><mode>3"
   [(set_attr "type" "vop2")
    (set_attr "length" "8,8")])
 
+(define_code_iterator fminmaxop [smin smax])
+(define_expand "<fexpander><mode>3"
+  [(set (match_operand:FP 0 "gcn_valu_dst_operand")
+	(fminmaxop:FP
+	  (match_operand:FP 1 "gcn_valu_src0_operand")
+	  (match_operand:FP 2 "gcn_valu_src1_operand")))]
+  ""
+  {})
+
+(define_expand "<fexpander><mode>3<exec>"
+  [(set (match_operand:V_FP 0 "gcn_valu_dst_operand")
+	(fminmaxop:V_FP
+	  (match_operand:V_FP 1 "gcn_valu_src0_operand")
+	  (match_operand:V_FP 2 "gcn_valu_src1_operand")))]
+  ""
+  {})
+
 ;; }}}
 ;; {{{ FP unops
 
@@ -3522,6 +3539,17 @@  (define_expand "reduc_<reduc_op>_scal_<mode>"
     DONE;
   })
 
+(define_expand "reduc_<fexpander>_scal_<mode>"
+  [(match_operand:<SCALAR_MODE> 0 "register_operand")
+   (fminmaxop:V_FP
+     (match_operand:V_FP 1 "register_operand"))]
+  ""
+  {
+    /* fmin/fmax are identical to smin/smax.  */
+    emit_insn (gen_reduc_<expander>_scal_<mode> (operands[0], operands[1]));
+    DONE;
+  })
+
 ;; Warning: This "-ffast-math" implementation converts in-order reductions
 ;;          into associative reductions. It's also used where OpenMP or
 ;;          OpenACC paralellization has already broken the in-order semantics.
diff --git a/gcc/config/gcn/gcn.md b/gcc/config/gcn/gcn.md
index 6c1a438f9d1..987b76396cc 100644
--- a/gcc/config/gcn/gcn.md
+++ b/gcc/config/gcn/gcn.md
@@ -372,6 +372,10 @@  (define_code_attr expander
    (sign_extend "extend")
    (zero_extend "zero_extend")])
 
+(define_code_attr fexpander
+  [(smin "fmin")
+   (smax "fmax")])
+
 ;; }}}
 ;; {{{ Miscellaneous instructions