From patchwork Tue Aug 9 13:23:47 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Andrew Stubbs X-Patchwork-Id: 12 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:6a10:20da:b0:2d3:3019:e567 with SMTP id n26csp2515214pxc; Tue, 9 Aug 2022 06:24:42 -0700 (PDT) X-Google-Smtp-Source: AA6agR7VQSMRXzgPR6s8tLLlR0O5/nhRcOOVKjyysO8MidmUOxOqUA2cDv+QDgxeC/Z/S8KMEZTK X-Received: by 2002:a17:907:75d5:b0:730:8baf:b314 with SMTP id jl21-20020a17090775d500b007308bafb314mr16703911ejc.587.1660051482774; Tue, 09 Aug 2022 06:24:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1660051482; cv=none; d=google.com; s=arc-20160816; b=LS51DHLBm3RHZ2K2+1r+uIWU6DRYLnMaJ9RE0u4g4y8PJJM8kyf6r5E3OUC6bUh1vC huI9uLKC2HFEj8OtJ77fxLG7yyh5GgDevwZk/dNMw1c5ZVxG8orcY6dXzxN4KnLrN0ch 0dZIl4xaNWkx7YVo0KhY5wn371us7WuenOv1fZBOSbyUGmQTyD9BS247VM/LyJkKbFse 4GtLsXRPJN624hW6Y/pd1pBD7aS1vJcrJIzUUciXP2UwspQNq9DfGyTyAlsKTahmYqjp fl6wmL/KzMmT+7zj7BnvtoYLDCYDmcAo0VcN3HAy88biSeDnnGiSQopiUUTsC9JFFlkO dFMw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:content-transfer-encoding :mime-version:message-id:date:subject:to:from:ironport-sdr :dmarc-filter:delivered-to; bh=1h8vzXLb0qmIiPJ3UDNauSdZg9F0iuJsNe8D0auB3zU=; b=ibqg2vRyLxpf9FJWlMeHMZo/CBB5G8nbK7F5i9lolCcOigxvBhwRwMKPfxbimF9hjG oFXaaw7SI2mYsStXo2h8Mth5fZrfgCM7e6b1M0Oa8GkpGRGxZOmu3MbDSTelfftrG/4i fhiE0yuCxm7uOt8d9j2pNFqbdRpw4yvRHA0OqOWxU1SUVpBZLJqiDj1BiThqd3PS6K47 FlTu+gezo7XxPEeie1/1qdektrNL0aPROSVRf3XwrFqvoVVvrxBbPNzw7H+CoQA39Ku9 RJlkxH/IOfG0TBL8TlfjkckcUb4JYGLFeXoYCeh/YG61oLINid62V6rpXM4Vw2ACcigT 1FGg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org" Received: from sourceware.org (ip-8-43-85-97.sourceware.org. [8.43.85.97]) by mx.google.com with ESMTPS id js17-20020a17090797d100b00711fa454fb3si1894907ejc.889.2022.08.09.06.24.42 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 09 Aug 2022 06:24:42 -0700 (PDT) Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) client-ip=8.43.85.97; Authentication-Results: mx.google.com; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org" Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 197DF3856962 for ; Tue, 9 Aug 2022 13:24:33 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from esa2.mentor.iphmx.com (esa2.mentor.iphmx.com [68.232.141.98]) by sourceware.org (Postfix) with ESMTPS id 9528A385702C for ; Tue, 9 Aug 2022 13:24:09 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 9528A385702C Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=codesourcery.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=mentor.com X-IronPort-AV: E=Sophos;i="5.93,224,1654588800"; d="scan'208";a="80994417" Received: from orw-gwy-01-in.mentorg.com ([192.94.38.165]) by esa2.mentor.iphmx.com with ESMTP; 09 Aug 2022 05:24:07 -0800 IronPort-SDR: 6hdRL9QwpQN4bUj39WK0cDU2qxaMmvkkDSLBn+Jg9ddxUbf00HINORRaG/VAx4MYIatX9aPRx5 w4AaCHcuIipBKKY6CgkprQX5JLf43s4330cjqXLhgDmyg5l0eqfzrWDkGtu06/5KNgfw6S03IS XvzB9nQTyP+bGtgtnAqplBlTK4iccgAMB/yCIdevANLn5EFTH6MoQGZeOnEZktZ14Wl1mATyPq 39hEvgm4jEyK0snQMlR5I6MSLYo6ICZqXXEyNCTk4ZK3T0Ul4L+6k/Dexa4hoA5WvAdJmrehD2 LRM= From: Andrew Stubbs To: Subject: [PATCH 0/3] OpenMP SIMD routines Date: Tue, 9 Aug 2022 14:23:47 +0100 Message-ID: X-Mailer: git-send-email 2.37.0 MIME-Version: 1.0 X-Originating-IP: [137.202.0.90] X-ClientProxiedBy: svr-ies-mbx-15.mgc.mentorg.com (139.181.222.15) To svr-ies-mbx-11.mgc.mentorg.com (139.181.222.11) X-Spam-Status: No, score=-5.5 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS, KAM_DMARC_STATUS, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org Sender: "Gcc-patches" X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1740690143747426054?= X-GMAIL-MSGID: =?utf-8?q?1740690143747426054?= This patch series implements OpenMP "simd" routines for amdgcn, and also adds support for "simd inbranch" routines for amdgcn, x86_64, and aarch64 (probably, I can't easily test it). I can approve patch 2 myself, but it depends on patch 1 so I include it here for context and completeness. I first tried to use "mask_mode = DImode", for amdgcn, but that does not produce great results because it ends up generating code to turn the mask into a vector and then back into the exact same mask, so I have settled on "mask_mode = VOIDmode", for now (in fact that uses fewer argument registers in many cases, so maybe it's better anyway). Additionally, I find that the x86_64 truth vectors cannot always be converted to the mask types specified by the backend, so I have pulled that code out completely. Therefore, this patch includes only "mask_mode == VOIDmode" support, but remains a step forward towards full SIMD clone support. I have not included dump-scans in the testcases for aarch64, but the testcases will still test correctness. The aarch64 maintainers can very easily add those scans if they choose. No other architecture has backend support for the clones at this time. OK for mainline (patches 1 & 3)? Thanks Andrew Andrew Stubbs (3): omp-simd-clone: Allow fixed-lane vectors amdgcn: OpenMP SIMD routine support vect: inbranch SIMD clones gcc/config/gcn/gcn.cc | 63 ++++++++ gcc/doc/tm.texi | 3 + gcc/omp-simd-clone.cc | 21 ++- gcc/target.def | 3 + gcc/testsuite/gcc.dg/vect/vect-simd-clone-1.c | 2 + .../gcc.dg/vect/vect-simd-clone-16.c | 89 ++++++++++++ .../gcc.dg/vect/vect-simd-clone-16b.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-16c.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-16d.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-16e.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-16f.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-17.c | 89 ++++++++++++ .../gcc.dg/vect/vect-simd-clone-17b.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-17c.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-17d.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-17e.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-17f.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-18.c | 89 ++++++++++++ .../gcc.dg/vect/vect-simd-clone-18b.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-18c.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-18d.c | 16 +++ .../gcc.dg/vect/vect-simd-clone-18e.c | 14 ++ .../gcc.dg/vect/vect-simd-clone-18f.c | 16 +++ gcc/testsuite/gcc.dg/vect/vect-simd-clone-2.c | 2 + gcc/testsuite/gcc.dg/vect/vect-simd-clone-3.c | 1 + gcc/testsuite/gcc.dg/vect/vect-simd-clone-4.c | 1 + gcc/testsuite/gcc.dg/vect/vect-simd-clone-5.c | 1 + gcc/testsuite/gcc.dg/vect/vect-simd-clone-8.c | 2 + gcc/tree-if-conv.cc | 39 ++++- gcc/tree-vect-stmts.cc | 134 ++++++++++++++---- 30 files changed, 734 insertions(+), 33 deletions(-) create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16b.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16c.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16d.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16e.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-16f.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17b.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17c.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17d.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17e.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-17f.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18b.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18c.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18d.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18e.c create mode 100644 gcc/testsuite/gcc.dg/vect/vect-simd-clone-18f.c