From patchwork Tue Feb 7 08:36:06 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jakub Jelinek X-Patchwork-Id: 53775 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:adf:eb09:0:0:0:0:0 with SMTP id s9csp2724835wrn; Tue, 7 Feb 2023 00:37:16 -0800 (PST) X-Google-Smtp-Source: AK7set8KhckkOTBAMyGAX1vmPtiH3VZmwLD4sW8Wp8tn50ovBO25R6yPUaCI1+JyVrUpxOQW4dcR X-Received: by 2002:a50:cc95:0:b0:48f:68b1:db4e with SMTP id q21-20020a50cc95000000b0048f68b1db4emr2483518edi.27.1675759036801; Tue, 07 Feb 2023 00:37:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1675759036; cv=none; d=google.com; s=arc-20160816; b=bjc5CCI1a2Z3CUvqxkfSv8itlqPDZQnAKYTxWe/r3CdLeLAsxg1F8ZIrHLrVZSF5wx a2p9IsXMOWl8Ek4brLrg/Aiuabn0xVGWi4E1inYRy3B6h0VPPbjaPjPZ8lERY6wR9D0l o+c5tk/8ZMKgH2ZHyqLTlaX4XujSM+9COMgbGqkRpKWZgCpb2LV62xxhbgjvA5+xR/qv BJXLfEKzo/yhgLLO2WF57maoKa1fnxSCdK5gGh3mkddHKw+k1/b+eFJ+bL0NK2n+qtqT Hywx9ZbyaHSYDkoqYHM+CfuQJ+trexuny/WMBm7aUN0vc+EUL90tdH4MOMU3CezPV0Vj Ppig== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:reply-to:from:list-subscribe:list-help:list-post :list-archive:list-unsubscribe:list-id:precedence :content-disposition:mime-version:message-id:subject:cc:to:date :dmarc-filter:delivered-to:dkim-signature:dkim-filter; bh=V4KuCd8i/fBQk01XFW/pde9YmtXgObIpQgTX7tpuZ+g=; b=eRImCurfAlziVEYpbbGFIZ/nwqsXOwHYelH253lofXaoFVb0HYcfPmuWfNv52g1Aai 7CZJSz1LUtJvjgRUItxtzFhE9UEwvtHbqnGRWhCd4iQhmN6XVn+M1ZnU5f+P+YAV3s6s yIfKbiyh51RbfIeHdnT8W4gUVoFBW/QngzseoSkmB5pw6lWczooqPu5KqhPTwaJIIdEl vMa1goUVmFFUddchsJL+1EqdzcPdRIdga+Ygh1VHla3qgoNfezTLBG1xteeUuzz+bj2G K1eHBxeq1w9yGYZA4GCtYZe1Q82tH/mPDU7IA9LI19VxTbraelK5tvzE+SgMJt3/qjvM 2frQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=XszMztUr; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from sourceware.org (server2.sourceware.org. [8.43.85.97]) by mx.google.com with ESMTPS id v17-20020aa7d9d1000000b004a0ddcc314esi12499348eds.248.2023.02.07.00.37.16 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 07 Feb 2023 00:37:16 -0800 (PST) Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) client-ip=8.43.85.97; Authentication-Results: mx.google.com; dkim=pass header.i=@gcc.gnu.org header.s=default header.b=XszMztUr; spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gnu.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id C74913858C33 for ; Tue, 7 Feb 2023 08:37:15 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org C74913858C33 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1675759035; bh=V4KuCd8i/fBQk01XFW/pde9YmtXgObIpQgTX7tpuZ+g=; h=Date:To:Cc:Subject:List-Id:List-Unsubscribe:List-Archive: List-Post:List-Help:List-Subscribe:From:Reply-To:From; b=XszMztUrw7AFUNtfh0NvbgN3lHlP/ep9awOnnLtSXeR1z3Xxb4zZF1s57cEjpT6Bm yXWIDnPnLfaX07ZZvJ/s8Jev38z1RyibjyAxKw0CQfWJfefrj0Rw777V7FnpOmBbni wYkhG1+Jt/CpezuRXuF7Z6O//JPeUM9x+TgQkEVg= X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by sourceware.org (Postfix) with ESMTPS id 609303858D1E for ; Tue, 7 Feb 2023 08:36:15 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 609303858D1E Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-226-zrEAnnxFP3m4fZO0LjbgFQ-1; Tue, 07 Feb 2023 03:36:11 -0500 X-MC-Unique: zrEAnnxFP3m4fZO0LjbgFQ-1 Received: from smtp.corp.redhat.com (int-mx01.intmail.prod.int.rdu2.redhat.com [10.11.54.1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 6DC5B80D0EF; Tue, 7 Feb 2023 08:36:11 +0000 (UTC) Received: from tucnak.zalov.cz (unknown [10.39.192.223]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 26F8F401014C; Tue, 7 Feb 2023 08:36:11 +0000 (UTC) Received: from tucnak.zalov.cz (localhost [127.0.0.1]) by tucnak.zalov.cz (8.17.1/8.17.1) with ESMTPS id 3178a8IH957479 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NOT); Tue, 7 Feb 2023 09:36:08 +0100 Received: (from jakub@localhost) by tucnak.zalov.cz (8.17.1/8.17.1/Submit) id 3178a7v2957476; Tue, 7 Feb 2023 09:36:07 +0100 Date: Tue, 7 Feb 2023 09:36:06 +0100 To: Jan Hubicka , Richard Biener Cc: gcc-patches@gcc.gnu.org Subject: [PATCH] cgraph: Handle simd clones in cgraph_node::set_{const,pure}_flag [PR106433] Message-ID: MIME-Version: 1.0 X-Scanned-By: MIMEDefang 3.1 on 10.11.54.1 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Disposition: inline X-Spam-Status: No, score=-3.5 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jakub Jelinek via Gcc-patches From: Jakub Jelinek Reply-To: Jakub Jelinek Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org Sender: "Gcc-patches" X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1757160707560146533?= X-GMAIL-MSGID: =?utf-8?q?1757160707560146533?= Hi! The following testcase ICEs, because we determine only in late pure const pass that bar is const (the content of the function loses a store to a global var during dse3 and read from it during cddce2) and local-pure-const2 makes it const. The cgraph ordering is that post IPA (in late IPA simd clones are created) bar is processed first, then foo as its caller, then foo.simdclone* and finally bar.simdclone*. Conceptually I think that is the right ordering which allows for static simd clones to be removed. The reason for the ICE is that because bar was marked const, the call to it lost vops before vectorization, and when we in foo.simdclone* try to vectorize the call to bar, we replace it with bar.simdclone* which hasn't been marked const and so needs vops, which we don't add. Now, because the simd clones are created from the same IL, just in a loop with different argument/return value passing, I think generally if the base function is determined to be const or pure, the simd clones should be too, unless e.g. the vectorization causes different optimization decisions, but then still the global memory reads if any shouldn't affect what the function does and global memory stores shouldn't be reachable at runtime. So, the following patch changes set_{const,pure}_flag to mark also simd clones. Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk? 2023-02-07 Jakub Jelinek PR tree-optimization/106433 * cgraph.cc (set_const_flag_1): Recurse on simd clones too. (cgraph_node::set_pure_flag): Call set_pure_flag_1 on simd clones too. * gcc.c-torture/compile/pr106433.c: New test. Jakub --- gcc/cgraph.cc.jj 2023-02-02 10:54:44.327473492 +0100 +++ gcc/cgraph.cc 2023-02-06 12:28:22.040593063 +0100 @@ -2764,6 +2764,9 @@ set_const_flag_1 (cgraph_node *node, boo if (!set_const || alias->get_availability () > AVAIL_INTERPOSABLE) set_const_flag_1 (alias, set_const, looping, changed); } + for (struct cgraph_node *n = node->simd_clones; n != NULL; + n = n->simdclone->next_clone) + set_const_flag_1 (n, set_const, looping, changed); for (cgraph_edge *e = node->callers; e; e = e->next_caller) if (e->caller->thunk && (!set_const || e->caller->get_availability () > AVAIL_INTERPOSABLE)) @@ -2876,6 +2879,9 @@ cgraph_node::set_pure_flag (bool pure, b { struct set_pure_flag_info info = {pure, looping, false}; call_for_symbol_thunks_and_aliases (set_pure_flag_1, &info, !pure, true); + for (struct cgraph_node *n = simd_clones; n != NULL; + n = n->simdclone->next_clone) + set_pure_flag_1 (n, &info); return info.changed; } --- gcc/testsuite/gcc.c-torture/compile/pr106433.c.jj 2023-02-06 12:37:26.963748811 +0100 +++ gcc/testsuite/gcc.c-torture/compile/pr106433.c 2023-02-06 12:37:06.631041918 +0100 @@ -0,0 +1,24 @@ +/* PR tree-optimization/106433 */ + +int m, *p; + +__attribute__ ((simd)) int +bar (int x) +{ + if (x) + { + if (m < 1) + for (m = 0; m < 1; ++m) + ++x; + p = &x; + for (;;) + ++m; + } + return 0; +} + +__attribute__ ((simd)) int +foo (int x) +{ + return bar (x); +}