From patchwork Fri Jun 16 07:32:40 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jan Beulich X-Patchwork-Id: 108900 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a59:994d:0:b0:3d9:f83d:47d9 with SMTP id k13csp1149765vqr; Fri, 16 Jun 2023 00:33:50 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ48g0uf8sQgWTmWx5jXbc5OA+KFlRSnm3GC9gnIYBg5XgBP7++efdyeAKYhvqQ0uXUkl8zd X-Received: by 2002:a17:907:2da4:b0:974:62d7:1467 with SMTP id gt36-20020a1709072da400b0097462d71467mr1102329ejc.5.1686900830176; Fri, 16 Jun 2023 00:33:50 -0700 (PDT) Received: from sourceware.org (ip-8-43-85-97.sourceware.org. [8.43.85.97]) by mx.google.com with ESMTPS id gg23-20020a170906e29700b009821ad6ad90si6045004ejb.555.2023.06.16.00.33.49 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 16 Jun 2023 00:33:50 -0700 (PDT) Received-SPF: pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 8.43.85.97 as permitted sender) client-ip=8.43.85.97; Authentication-Results: mx.google.com; dkim=pass header.i=@sourceware.org header.s=default header.b=oOQfeATM; arc=fail (signature failed); spf=pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="binutils-bounces+ouuuleilei=gmail.com@sourceware.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id AC37E3853D1F for ; Fri, 16 Jun 2023 07:32:57 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org AC37E3853D1F DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1686900777; bh=9+Xx/2l1HF538RHagGizVVDbokFaQnqDRHkJAfCC33Y=; h=Date:Subject:To:Cc:References:In-Reply-To:List-Id: List-Unsubscribe:List-Archive:List-Post:List-Help:List-Subscribe: From:Reply-To:From; b=oOQfeATMyhYaklg7AfEg7x5pFBmJ63C/KKkQuKzGpUOJJVZ3X2FaegEeKQldE3fXT UUYAQUL5WknM1z4km7R2KLVYTSvf2MHxykR1LmggfkK2Sh7MJwpce8aw5ggJ3VRcH0 1CeEKZRq2TJ1Uymbsf/ClXXUFyupHXZYOtwwHowA= X-Original-To: binutils@sourceware.org Delivered-To: binutils@sourceware.org Received: from EUR03-DBA-obe.outbound.protection.outlook.com (mail-dbaeur03on2058.outbound.protection.outlook.com [40.107.104.58]) by sourceware.org (Postfix) with ESMTPS id 65AD638313A6 for ; Fri, 16 Jun 2023 07:32:45 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 65AD638313A6 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=K7LH0zO57G+6YjbVrqufKNSoCx1ETR+BtGhfdqoiSZERlRk/wJ7aQk+25yAa4sVVTses8LHr3WioqJ4wcxxh9csMbjy2KNZkKBNhHg+YJw/MugnwX4U7Pj2EkNIMs1EG5xttoUUsOsEyCL7hy2RmvjR7VrKMnG3yNTnORiOXPrEitFGqEnbnDNMObukVm7rQqtyV68U+CIrzdKXWYuN1n2gBaw1DC60HJO3h2kJZB8Nsl7BGj6mdp3DN+WbjznBVwiH6mTgUbV8DmtPaWleKUdEINt+3/XplFt+ayth51xdq7OhXbReKu8Ts65vfWjrG0vFui6f/LAi0nCyOXWM0rQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=9+Xx/2l1HF538RHagGizVVDbokFaQnqDRHkJAfCC33Y=; b=Zolb8RNCeayPRlX67+Q8fxKO+YwBD89YzkvE/v5sZU82Q+cKJHO4LW1+mCxKCsGfOuHsXXntCrkTAC3SLl1MUqexxm2uKVo5gmlXJGIOuEUYIhqD1g3vfigK9uCyZ3942MmVjRNTRoFgSiDmqrhlvmBiiUZCJavjOUugUUqyWg1Ikqxjb+CpaDUhFh/YZoBXPnzzrQhQnNOcrm+vC31qlnXDsK925ed03467PV1LcHrV29zVA/5mW9XwfsrTPWH3ttzTEmpxgg+wusyFg0W+/xqClG3olvpG+8X2li0IgqXeCnpLLw8WA7gWGy3c1voC+PcfcWBRRaHePvznE/auQA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none Received: from VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) by AM0PR04MB6913.eurprd04.prod.outlook.com (2603:10a6:208:184::24) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6500.29; Fri, 16 Jun 2023 07:32:42 +0000 Received: from VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c]) by VE1PR04MB6560.eurprd04.prod.outlook.com ([fe80::e442:306f:7711:e24c%5]) with mapi id 15.20.6455.039; Fri, 16 Jun 2023 07:32:42 +0000 Message-ID: <08bf9dc9-5616-7dce-a094-d2ea799c92bf@suse.com> Date: Fri, 16 Jun 2023 09:32:40 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.12.0 Subject: [PATCH 4/4] x86: provide a 128-bit VBROADCASTSD pseudo Content-Language: en-US To: Binutils Cc: "H.J. Lu" References: In-Reply-To: X-ClientProxiedBy: FR0P281CA0127.DEUP281.PROD.OUTLOOK.COM (2603:10a6:d10:97::8) To VE1PR04MB6560.eurprd04.prod.outlook.com (2603:10a6:803:122::25) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: VE1PR04MB6560:EE_|AM0PR04MB6913:EE_ X-MS-Office365-Filtering-Correlation-Id: ca9e4487-803f-4cc1-e7f2-08db6e3bde66 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: qpDxDtzMBZDKMm5h5dR9nzUBzW+UiqtODKX9vCUZ5B1WvSLLnTGe6IPl8Orq7Cs0N5W2DzhnVk06SRWCtI5PScGlyAOSf5izFgTSVRc5kQObU7YNlmxdTsmoAfwvQs1PeJrA03WlYvzXLdVaQ7e5UtIAm1LlONp5aIquzk73D21seTtzxmxXTTr31yrzUNvZJf58Trc1YgJVGHi5HU8zfcz2TS/4AlcZKNjF62vY51vIqvptngFlB8cK+r4YLfQqfWyRNasvNy/41+kZgU/VzRVFglDC13x3qDazmB/dNM3Bcgqlu6s+cXMcaHteRTk2L2MKhOeyfL6McDCmBiD3igRMQCMGIzFMoSAELVLnbaNvGKoPakm+e4yL/TW+GtdWj1Move2CpQAY4Lj2+hXm3ztmbm2X/CSYqQA8Afyme7zR7fYWEha8H3hLBzea2U36yyaQfc3AcstvkS4dKTd+jwtcnC5IrMd0lBCO6orJVjO9ffKgHr6mbhdvSvbcTUsWCYzX0WPRYZr57PGKofmwKhyYjpc/5+VDIbB/8VTV8ph7HGSxyTzMb5mR02MhS/RCC6iT/SJONgGA/HzsMP23ftIgGVZEOkF+dOvABz/4ibuFf5Ky5z5LgQhM41O8ZlvNn4mBlvBShvNkzBkjOoCy0g== X-Forefront-Antispam-Report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR04MB6560.eurprd04.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230028)(376002)(396003)(366004)(136003)(346002)(39860400002)(451199021)(478600001)(6486002)(5660300002)(41300700001)(8936002)(8676002)(2906002)(30864003)(36756003)(86362001)(31696002)(38100700002)(66476007)(66556008)(66946007)(316002)(6916009)(4326008)(26005)(6512007)(6506007)(186003)(31686004)(83380400001)(2616005)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?gIpz98dlYnM85923ZdEPJsan5OKb?= =?utf-8?q?AQC2FLYtlqAFglZqmUg0ThwNeG4AJMFQLE0h0CQIjekkYEQu7wq4riq+v8Kt7ZuVT?= =?utf-8?q?Cutt7AMWNTiYJUjkOoPid5eSPccpDpFFtIIAx6DtCzOUV2dAYnPuqSCIFq8DjAbhG?= =?utf-8?q?d3I5kJAA+K6mRIyA/SE+mby7oYrMNEQ4/nSlPYgyHn3JSlvKqs+3iL05L4tZqq2qq?= =?utf-8?q?1CPOydlHkHngAvL5+7qZ61EO13rXziql0te95PbN5AIUrF68e98Z8AvMEsumjiR3h?= =?utf-8?q?eexa0qkqqnEbf2YTht8uZR0LVbCwtpVqssZzYqTv34B52ufBKTY7obJL4ks5o35XV?= =?utf-8?q?HEdv5Bq+85nXEr5FqXG/zMXURCcJXaCGOlXtEkZGAF+E2N1/ezSBkOIrKwDSgUIut?= =?utf-8?q?3tUdJB0vcJfz7w5ZZZJp5Aug/C19YjIrAXWit70GI9qEdiyyVj42jSbTTv0f6PGxc?= =?utf-8?q?p6sX9NGM0I1NXImV8a1xTBY7gCI5awE5ZuS4RZKnJ6pHCZGHZbVUH0cx47FUu7WBf?= =?utf-8?q?ydOoIQTu7JJuXi0fB7Mjvp5cZz/8om9D1qPLG2TdZ/qPXvCyX3JKW19QWx/8bv//9?= =?utf-8?q?Y8w81q3ZnN+YAM/HKZkrqZkOrbkuz0bm/8grnkHWSpTYON95kT67Ai49fP4vZIdNw?= =?utf-8?q?GK6oE1L3Lrx/nYhq5jjmj92KR2TFSONbUpmumHGcjd+aIm+ouSrXnLS/MPZkaZeEF?= =?utf-8?q?FSXyQV4OMCu0Ia0HGsZ6jch//WTBB1gViff/LvwogMANssaEkIydrsisYobWVJI4e?= =?utf-8?q?AxXnNLOKusl5Pw29ImWBnzULOnYZlmDjELTmyR6IEbgQOSz8iZSmBXQ242ISoWCz2?= =?utf-8?q?g5MBr4HF3sVGDn6D0zJzRj8XZsxtPOeQPXsNYKn5QU2BAReMlauzrhpTLw5Dfq1LH?= =?utf-8?q?8TG09EzHadAzlYZwEopdMp3eYJujdfgTRgOTXf+FuiGishwFM9yz/5L9dyBCDPTSp?= =?utf-8?q?eWFRU1HU8Ru3WLihPvC8qTeJZuKoxKCcA5PEKCmdcXAUYJ5AwdIwqwbtKvTHXTgwl?= =?utf-8?q?ecSH9VpUlAZHKr6/g53JF5dYTgxRgR6J/Nl/J6iXlfbER+gTKdc3nG3nHz06CnrsC?= =?utf-8?q?fgeq0nbINJC1z5PcWLWiLHV425cJvNzD5s2+pXS2pTq0OLGiuSStKo+fcWvMJEDXk?= =?utf-8?q?Y1bz0YQxO2thN+sqnC96nkHZc9thp80FcJAyWnV31YrPv2+VKbXdW6DqwQjy0dOp2?= =?utf-8?q?HtBS038fJVMDfxHXAz1mL2p005zJvZEvOzgyino+cgRn2U7hAeVwWZEfzvn2IYcLF?= =?utf-8?q?wW+Iyct8V72Znpr1fUmbDX/upSSsT+hZvE5gd3+h+pNu9bqvtElm4cgIedchouaMP?= =?utf-8?q?XCtrNXtsB/wYbIbG/CiowqNYNJ2Vxm27bXT+GtmbFwzJRt7rUilNdpiC3j+qliv/e?= =?utf-8?q?JX0PID9TJgB+Rfmz3jdgNP4b2Q6kqjQlvCa3XTIKLOLzbPU9g+f9Q+SDcEJD5HEFb?= =?utf-8?q?jFX1tfBmCOLT6cjq74t3kM+/7FioDdTsMjFlarbST2sskxRVBcFQI7YVmuTX0VSwo?= =?utf-8?q?I59OKDy0EYKH?= X-OriginatorOrg: suse.com X-MS-Exchange-CrossTenant-Network-Message-Id: ca9e4487-803f-4cc1-e7f2-08db6e3bde66 X-MS-Exchange-CrossTenant-AuthSource: VE1PR04MB6560.eurprd04.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 16 Jun 2023 07:32:42.3573 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: tKZsDJrzjDz9lGu0SZWF3V6pM4ZpuSjstxItVLYWjtF9P8JlpVEalMGQ5dOpgfEhfJcd6tY4Es3e8ALRd46b/w== X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR04MB6913 X-Spam-Status: No, score=-3027.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: binutils@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Binutils mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Jan Beulich via Binutils From: Jan Beulich Reply-To: Jan Beulich Errors-To: binutils-bounces+ouuuleilei=gmail.com@sourceware.org Sender: "Binutils" X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1768843724728924657?= X-GMAIL-MSGID: =?utf-8?q?1768843724728924657?= VBROADCASTSD not supporting 128-bit destinations in any of their AVX, AVX2, or AVX512F incarnations is presumably because of VMOVDDUP precisely supporting this very operation. (It is therefore different from e.g. VPBROADCASTQ, which has no exact equivalent.) Still its absence has led to people using VPBROADCASTQ as substitution; this could have been avoided if such a pseudo had been supported from the very beginning. Note that the pseudos try to match what the real instructions would have used as closely as possible, i.e. VexW0 instead of VexWIG for the AVX and AVX2 forms as well as AVX2 in the first place for the register source form. --- For being the first example of us supplying such, this is partly RFC. On top of that a question is also whether to indeed have split AVX/AVX2 templates, when in principle one (allowing for both memory and register source) could do. --- a/gas/testsuite/gas/i386/avx.d +++ b/gas/testsuite/gas/i386/avx.d @@ -927,6 +927,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 35 21 vpmovzxdq \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd %xmm4,\(%ecx\) [ ]*[a-f0-9]+: c5 f8 13 21 vmovlps %xmm4,\(%ecx\) @@ -2768,6 +2769,8 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd %xmm4,\(%ecx\) --- a/gas/testsuite/gas/i386/avx.s +++ b/gas/testsuite/gas/i386/avx.s @@ -982,6 +982,7 @@ _start: vucomisd (%ecx),%xmm4 # Tests for op mem64, xmm + vbroadcastsd (%ecx),%xmm4 vmovsd (%ecx),%xmm4 # Tests for op xmm, mem64 @@ -2953,6 +2954,8 @@ _start: vucomisd xmm4,[ecx] # Tests for op mem64, xmm + vbroadcastsd xmm4,QWORD PTR [ecx] + vbroadcastsd xmm4,[ecx] vmovsd xmm4,QWORD PTR [ecx] vmovsd xmm4,[ecx] --- a/gas/testsuite/gas/i386/avx-16bit.d +++ b/gas/testsuite/gas/i386/avx-16bit.d @@ -928,6 +928,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: 67 c4 e2 79 35 21 vpmovzxdq \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: 67 c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: 67 c5 fb 12 21 vmovddup \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 f9 13 21 vmovlpd %xmm4,\(%ecx\) [ ]*[a-f0-9]+: 67 c5 f8 13 21 vmovlps %xmm4,\(%ecx\) @@ -2769,6 +2770,8 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: 67 c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 f9 2e 21 vucomisd \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: 67 c5 fb 12 21 vmovddup \(%ecx\),%xmm4 +[ ]*[a-f0-9]+: 67 c5 fb 12 21 vmovddup \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 fb 10 21 vmovsd \(%ecx\),%xmm4 [ ]*[a-f0-9]+: 67 c5 f9 13 21 vmovlpd %xmm4,\(%ecx\) --- a/gas/testsuite/gas/i386/avx-intel.d +++ b/gas/testsuite/gas/i386/avx-intel.d @@ -928,6 +928,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 35 21 vpmovzxdq xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd xmm6,xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[ecx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd QWORD PTR \[ecx\],xmm4 [ ]*[a-f0-9]+: c5 f8 13 21 vmovlps QWORD PTR \[ecx\],xmm4 @@ -2769,6 +2770,8 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd xmm6,xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[ecx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[ecx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[ecx\] [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd QWORD PTR \[ecx\],xmm4 --- a/gas/testsuite/gas/i386/avx2.d +++ b/gas/testsuite/gas/i386/avx2.d @@ -73,6 +73,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 78 21 vpbroadcastb \(%ecx\),%xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb %xmm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%ecx\),%ymm4 +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 5d 8c 31 vpmaskmovd \(%ecx\),%ymm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 4d 8e 21 vpmaskmovd %ymm4,%ymm6,\(%ecx\) @@ -177,5 +178,6 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb %xmm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%ecx\),%ymm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%ecx\),%ymm4 +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss %xmm4,%xmm6 #pass --- a/gas/testsuite/gas/i386/avx2.s +++ b/gas/testsuite/gas/i386/avx2.s @@ -114,6 +114,7 @@ _start: vpbroadcastb (%ecx),%ymm4 # Tests for op xmm, xmm + vbroadcastsd %xmm4,%xmm6 vbroadcastss %xmm4,%xmm6 .intel_syntax noprefix @@ -265,4 +266,5 @@ _start: vpbroadcastb ymm4,[ecx] # Tests for op xmm, xmm + vbroadcastsd xmm6,xmm4 vbroadcastss xmm6,xmm4 --- a/gas/testsuite/gas/i386/avx2-intel.d +++ b/gas/testsuite/gas/i386/avx2-intel.d @@ -74,6 +74,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 78 21 vpbroadcastb xmm4,BYTE PTR \[ecx\] [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb ymm6,xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[ecx\] +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 5d 8c 31 vpmaskmovd ymm6,ymm4,YMMWORD PTR \[ecx\] [ ]*[a-f0-9]+: c4 e2 4d 8e 21 vpmaskmovd YMMWORD PTR \[ecx\],ymm6,ymm4 @@ -178,5 +179,6 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb ymm6,xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[ecx\] [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[ecx\] +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss xmm6,xmm4 #pass --- a/gas/testsuite/gas/i386/avx512f_vl.d +++ b/gas/testsuite/gas/i386/avx512f_vl.d @@ -155,6 +155,15 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 00 08 00 00[ ]*vbroadcasti32x4 0x800\(%edx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a 72 80[ ]*vbroadcasti32x4 -0x800\(%edx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 -0x810\(%edx\),%ymm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 31[ ]*vmovddup \(%ecx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 31[ ]*vmovddup \(%ecx\),%xmm6\{%k7\}\{z\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b4 f4 c0 1d fe ff[ ]*vmovddup -0x1e240\(%esp,%esi,8\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 7f[ ]*vmovddup 0x3f8\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 00 04 00 00[ ]*vmovddup 0x400\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 80[ ]*vmovddup -0x400\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 f8 fb ff ff[ ]*vmovddup -0x408\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 f5[ ]*vmovddup %xmm5,%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 f5[ ]*vmovddup %xmm5,%xmm6\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 31[ ]*vbroadcastsd \(%ecx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 fd af 19 31[ ]*vbroadcastsd \(%ecx\),%ymm6\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 b4 f4 c0 1d fe ff[ ]*vbroadcastsd -0x1e240\(%esp,%esi,8\),%ymm6\{%k7\} @@ -5850,6 +5859,15 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 00 08 00 00[ ]*vbroadcasti32x4 0x800\(%edx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a 72 80[ ]*vbroadcasti32x4 -0x800\(%edx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 -0x810\(%edx\),%ymm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 31[ ]*vmovddup \(%ecx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 31[ ]*vmovddup \(%ecx\),%xmm6\{%k7\}\{z\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b4 f4 c0 1d fe ff[ ]*vmovddup -0x1e240\(%esp,%esi,8\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 7f[ ]*vmovddup 0x3f8\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 00 04 00 00[ ]*vmovddup 0x400\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 80[ ]*vmovddup -0x400\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 f8 fb ff ff[ ]*vmovddup -0x408\(%edx\),%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 f5[ ]*vmovddup %xmm5,%xmm6\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 f5[ ]*vmovddup %xmm5,%xmm6\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 31[ ]*vbroadcastsd \(%ecx\),%ymm6\{%k7\} [ ]*[a-f0-9]+:[ ]*62 f2 fd af 19 31[ ]*vbroadcastsd \(%ecx\),%ymm6\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 b4 f4 c0 1d fe ff[ ]*vbroadcastsd -0x1e240\(%esp,%esi,8\),%ymm6\{%k7\} --- a/gas/testsuite/gas/i386/avx512f_vl.s +++ b/gas/testsuite/gas/i386/avx512f_vl.s @@ -149,6 +149,15 @@ _start: vbroadcasti32x4 2048(%edx), %ymm6{%k7} # AVX512{F,VL} vbroadcasti32x4 -2048(%edx), %ymm6{%k7} # AVX512{F,VL} Disp8 vbroadcasti32x4 -2064(%edx), %ymm6{%k7} # AVX512{F,VL} + vbroadcastsd (%ecx), %xmm6{%k7} # AVX512{F,VL} + vbroadcastsd (%ecx), %xmm6{%k7}{z} # AVX512{F,VL} + vbroadcastsd -123456(%esp,%esi,8), %xmm6{%k7} # AVX512{F,VL} + vbroadcastsd 1016(%edx), %xmm6{%k7} # AVX512{F,VL} Disp8 + vbroadcastsd 1024(%edx), %xmm6{%k7} # AVX512{F,VL} + vbroadcastsd -1024(%edx), %xmm6{%k7} # AVX512{F,VL} Disp8 + vbroadcastsd -1032(%edx), %xmm6{%k7} # AVX512{F,VL} + vbroadcastsd %xmm5, %xmm6{%k7} # AVX512{F,VL} + vbroadcastsd %xmm5, %xmm6{%k7}{z} # AVX512{F,VL} vbroadcastsd (%ecx), %ymm6{%k7} # AVX512{F,VL} vbroadcastsd (%ecx), %ymm6{%k7}{z} # AVX512{F,VL} vbroadcastsd -123456(%esp,%esi,8), %ymm6{%k7} # AVX512{F,VL} @@ -5846,6 +5855,15 @@ _start: vbroadcasti32x4 ymm6{k7}, XMMWORD PTR [edx+2048] # AVX512{F,VL} vbroadcasti32x4 ymm6{k7}, XMMWORD PTR [edx-2048] # AVX512{F,VL} Disp8 vbroadcasti32x4 ymm6{k7}, XMMWORD PTR [edx-2064] # AVX512{F,VL} + vbroadcastsd xmm6{k7}, QWORD PTR [ecx] # AVX512{F,VL} + vbroadcastsd xmm6{k7}{z}, QWORD PTR [ecx] # AVX512{F,VL} + vbroadcastsd xmm6{k7}, QWORD PTR [esp+esi*8-123456] # AVX512{F,VL} + vbroadcastsd xmm6{k7}, QWORD PTR [edx+1016] # AVX512{F,VL} Disp8 + vbroadcastsd xmm6{k7}, QWORD PTR [edx+1024] # AVX512{F,VL} + vbroadcastsd xmm6{k7}, QWORD PTR [edx-1024] # AVX512{F,VL} Disp8 + vbroadcastsd xmm6{k7}, QWORD PTR [edx-1032] # AVX512{F,VL} + vbroadcastsd xmm6{k7}, xmm5 # AVX512{F,VL} + vbroadcastsd xmm6{k7}{z}, xmm5 # AVX512{F,VL} vbroadcastsd ymm6{k7}, QWORD PTR [ecx] # AVX512{F,VL} vbroadcastsd ymm6{k7}{z}, QWORD PTR [ecx] # AVX512{F,VL} vbroadcastsd ymm6{k7}, QWORD PTR [esp+esi*8-123456] # AVX512{F,VL} --- a/gas/testsuite/gas/i386/avx512f_vl-intel.d +++ b/gas/testsuite/gas/i386/avx512f_vl-intel.d @@ -155,6 +155,15 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 00 08 00 00[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx\+0x800\] [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a 72 80[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx-0x800\] [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx-0x810\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 31[ ]*vmovddup xmm6\{k7\},QWORD PTR \[ecx\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 31[ ]*vmovddup xmm6\{k7\}\{z\},QWORD PTR \[ecx\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b4 f4 c0 1d fe ff[ ]*vmovddup xmm6\{k7\},QWORD PTR \[esp\+esi\*8-0x1e240\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 7f[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx\+0x3f8\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 00 04 00 00[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx\+0x400\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 80[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx-0x400\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 f8 fb ff ff[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx-0x408\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 f5[ ]*vmovddup xmm6\{k7\},xmm5 +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 f5[ ]*vmovddup xmm6\{k7\}\{z\},xmm5 [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 31[ ]*vbroadcastsd ymm6\{k7\},QWORD PTR \[ecx\] [ ]*[a-f0-9]+:[ ]*62 f2 fd af 19 31[ ]*vbroadcastsd ymm6\{k7\}\{z\},QWORD PTR \[ecx\] [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 b4 f4 c0 1d fe ff[ ]*vbroadcastsd ymm6\{k7\},QWORD PTR \[esp\+esi\*8-0x1e240\] @@ -5850,6 +5859,15 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 00 08 00 00[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx\+0x800\] [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a 72 80[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx-0x800\] [ ]*[a-f0-9]+:[ ]*62 f2 7d 2f 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 ymm6\{k7\},XMMWORD PTR \[edx-0x810\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 31[ ]*vmovddup xmm6\{k7\},QWORD PTR \[ecx\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 31[ ]*vmovddup xmm6\{k7\}\{z\},QWORD PTR \[ecx\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b4 f4 c0 1d fe ff[ ]*vmovddup xmm6\{k7\},QWORD PTR \[esp\+esi\*8-0x1e240\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 7f[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx\+0x3f8\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 00 04 00 00[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx\+0x400\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 72 80[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx-0x400\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 b2 f8 fb ff ff[ ]*vmovddup xmm6\{k7\},QWORD PTR \[edx-0x408\] +[ ]*[a-f0-9]+:[ ]*62 f1 ff 0f 12 f5[ ]*vmovddup xmm6\{k7\},xmm5 +[ ]*[a-f0-9]+:[ ]*62 f1 ff 8f 12 f5[ ]*vmovddup xmm6\{k7\}\{z\},xmm5 [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 31[ ]*vbroadcastsd ymm6\{k7\},QWORD PTR \[ecx\] [ ]*[a-f0-9]+:[ ]*62 f2 fd af 19 31[ ]*vbroadcastsd ymm6\{k7\}\{z\},QWORD PTR \[ecx\] [ ]*[a-f0-9]+:[ ]*62 f2 fd 2f 19 b4 f4 c0 1d fe ff[ ]*vbroadcastsd ymm6\{k7\},QWORD PTR \[esp\+esi\*8-0x1e240\] --- a/gas/testsuite/gas/i386/x86-64-avx.d +++ b/gas/testsuite/gas/i386/x86-64-avx.d @@ -875,6 +875,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 35 21 vpmovzxdq \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%rcx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd %xmm4,\(%rcx\) [ ]*[a-f0-9]+: c5 f8 13 21 vmovlps %xmm4,\(%rcx\) @@ -2818,6 +2819,8 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd %xmm4,%xmm6 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd \(%rcx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%rcx\),%xmm4 +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd %xmm4,\(%rcx\) --- a/gas/testsuite/gas/i386/x86-64-avx.s +++ b/gas/testsuite/gas/i386/x86-64-avx.s @@ -930,6 +930,7 @@ _start: vucomisd (%rcx),%xmm4 # Tests for op mem64, xmm + vbroadcastsd (%rcx),%xmm4 vmovsd (%rcx),%xmm4 # Tests for op xmm, mem64 @@ -3024,6 +3025,8 @@ _start: vucomisd xmm4,[rcx] # Tests for op mem64, xmm + vbroadcastsd xmm4,QWORD PTR [rcx] + vbroadcastsd xmm4,[rcx] vmovsd xmm4,QWORD PTR [rcx] vmovsd xmm4,[rcx] --- a/gas/testsuite/gas/i386/x86-64-avx-intel.d +++ b/gas/testsuite/gas/i386/x86-64-avx-intel.d @@ -876,6 +876,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 35 21 vpmovzxdq xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd xmm6,xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[rcx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd QWORD PTR \[rcx\],xmm4 [ ]*[a-f0-9]+: c5 f8 13 21 vmovlps QWORD PTR \[rcx\],xmm4 @@ -2819,6 +2820,8 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c5 f9 2e f4 vucomisd xmm6,xmm4 [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 f9 2e 21 vucomisd xmm4,QWORD PTR \[rcx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[rcx\] +[ ]*[a-f0-9]+: c5 fb 12 21 vmovddup xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 fb 10 21 vmovsd xmm4,QWORD PTR \[rcx\] [ ]*[a-f0-9]+: c5 f9 13 21 vmovlpd QWORD PTR \[rcx\],xmm4 --- a/gas/testsuite/gas/i386/x86-64-avx2.d +++ b/gas/testsuite/gas/i386/x86-64-avx2.d @@ -73,6 +73,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 78 21 vpbroadcastb \(%rcx\),%xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb %xmm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%rcx\),%ymm4 +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 5d 8c 31 vpmaskmovd \(%rcx\),%ymm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 4d 8e 21 vpmaskmovd %ymm4,%ymm6,\(%rcx\) @@ -177,5 +178,6 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb %xmm4,%ymm6 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%rcx\),%ymm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb \(%rcx\),%ymm4 +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup %xmm4,%xmm6 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss %xmm4,%xmm6 #pass --- a/gas/testsuite/gas/i386/x86-64-avx2.s +++ b/gas/testsuite/gas/i386/x86-64-avx2.s @@ -114,6 +114,7 @@ _start: vpbroadcastb (%rcx),%ymm4 # Tests for op xmm, xmm + vbroadcastsd %xmm4,%xmm6 vbroadcastss %xmm4,%xmm6 .intel_syntax noprefix @@ -265,4 +266,5 @@ _start: vpbroadcastb ymm4,[rcx] # Tests for op xmm, xmm + vbroadcastsd xmm6,xmm4 vbroadcastss xmm6,xmm4 --- a/gas/testsuite/gas/i386/x86-64-avx2-intel.d +++ b/gas/testsuite/gas/i386/x86-64-avx2-intel.d @@ -74,6 +74,7 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 79 78 21 vpbroadcastb xmm4,BYTE PTR \[rcx\] [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb ymm6,xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[rcx\] +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 5d 8c 31 vpmaskmovd ymm6,ymm4,YMMWORD PTR \[rcx\] [ ]*[a-f0-9]+: c4 e2 4d 8e 21 vpmaskmovd YMMWORD PTR \[rcx\],ymm6,ymm4 @@ -178,5 +179,6 @@ Disassembly of section .text: [ ]*[a-f0-9]+: c4 e2 7d 78 f4 vpbroadcastb ymm6,xmm4 [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[rcx\] [ ]*[a-f0-9]+: c4 e2 7d 78 21 vpbroadcastb ymm4,BYTE PTR \[rcx\] +[ ]*[a-f0-9]+: c5 fb 12 f4 vmovddup xmm6,xmm4 [ ]*[a-f0-9]+: c4 e2 79 18 f4 vbroadcastss xmm6,xmm4 #pass --- a/gas/testsuite/gas/i386/x86-64-avx512f_vl.d +++ b/gas/testsuite/gas/i386/x86-64-avx512f_vl.d @@ -167,6 +167,17 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 00 08 00 00[ ]*vbroadcasti32x4 0x800\(%rdx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a 72 80[ ]*vbroadcasti32x4 -0x800\(%rdx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 -0x810\(%rdx\),%ymm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 31[ ]*vmovddup \(%rcx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 0f 12 31[ ]*vmovddup \(%rcx\),%xmm30\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 61 ff 8f 12 31[ ]*vmovddup \(%rcx\),%xmm30\{%k7\}\{z\} +[ ]*[a-f0-9]+:[ ]*62 21 ff 08 12 b4 f0 23 01 00 00[ ]*vmovddup 0x123\(%rax,%r14,8\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 7f[ ]*vmovddup 0x3f8\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 00 04 00 00[ ]*vmovddup 0x400\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 80[ ]*vmovddup -0x400\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 f8 fb ff ff[ ]*vmovddup -0x408\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 01 ff 08 12 f5[ ]*vmovddup %xmm29,%xmm30 +[ ]*[a-f0-9]+:[ ]*62 01 ff 0f 12 f5[ ]*vmovddup %xmm29,%xmm30\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 01 ff 8f 12 f5[ ]*vmovddup %xmm29,%xmm30\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 62 fd 28 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 fd 2f 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30\{%k7\} [ ]*[a-f0-9]+:[ ]*62 62 fd af 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30\{%k7\}\{z\} @@ -6474,6 +6485,17 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 00 08 00 00[ ]*vbroadcasti32x4 0x800\(%rdx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a 72 80[ ]*vbroadcasti32x4 -0x800\(%rdx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 -0x810\(%rdx\),%ymm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 31[ ]*vmovddup \(%rcx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 0f 12 31[ ]*vmovddup \(%rcx\),%xmm30\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 61 ff 8f 12 31[ ]*vmovddup \(%rcx\),%xmm30\{%k7\}\{z\} +[ ]*[a-f0-9]+:[ ]*62 21 ff 08 12 b4 f0 34 12 00 00[ ]*vmovddup 0x1234\(%rax,%r14,8\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 7f[ ]*vmovddup 0x3f8\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 00 04 00 00[ ]*vmovddup 0x400\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 80[ ]*vmovddup -0x400\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 f8 fb ff ff[ ]*vmovddup -0x408\(%rdx\),%xmm30 +[ ]*[a-f0-9]+:[ ]*62 01 ff 08 12 f5[ ]*vmovddup %xmm29,%xmm30 +[ ]*[a-f0-9]+:[ ]*62 01 ff 0f 12 f5[ ]*vmovddup %xmm29,%xmm30\{%k7\} +[ ]*[a-f0-9]+:[ ]*62 01 ff 8f 12 f5[ ]*vmovddup %xmm29,%xmm30\{%k7\}\{z\} [ ]*[a-f0-9]+:[ ]*62 62 fd 28 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30 [ ]*[a-f0-9]+:[ ]*62 62 fd 2f 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30\{%k7\} [ ]*[a-f0-9]+:[ ]*62 62 fd af 19 31[ ]*vbroadcastsd \(%rcx\),%ymm30\{%k7\}\{z\} --- a/gas/testsuite/gas/i386/x86-64-avx512f_vl.s +++ b/gas/testsuite/gas/i386/x86-64-avx512f_vl.s @@ -161,6 +161,17 @@ _start: vbroadcasti32x4 2048(%rdx), %ymm30 # AVX512{F,VL} vbroadcasti32x4 -2048(%rdx), %ymm30 # AVX512{F,VL} Disp8 vbroadcasti32x4 -2064(%rdx), %ymm30 # AVX512{F,VL} + vbroadcastsd (%rcx), %xmm30 # AVX512{F,VL} + vbroadcastsd (%rcx), %xmm30{%k7} # AVX512{F,VL} + vbroadcastsd (%rcx), %xmm30{%k7}{z} # AVX512{F,VL} + vbroadcastsd 0x123(%rax,%r14,8), %xmm30 # AVX512{F,VL} + vbroadcastsd 1016(%rdx), %xmm30 # AVX512{F,VL} Disp8 + vbroadcastsd 1024(%rdx), %xmm30 # AVX512{F,VL} + vbroadcastsd -1024(%rdx), %xmm30 # AVX512{F,VL} Disp8 + vbroadcastsd -1032(%rdx), %xmm30 # AVX512{F,VL} + vbroadcastsd %xmm29, %xmm30 # AVX512{F,VL} + vbroadcastsd %xmm29, %xmm30{%k7} # AVX512{F,VL} + vbroadcastsd %xmm29, %xmm30{%k7}{z} # AVX512{F,VL} vbroadcastsd (%rcx), %ymm30 # AVX512{F,VL} vbroadcastsd (%rcx), %ymm30{%k7} # AVX512{F,VL} vbroadcastsd (%rcx), %ymm30{%k7}{z} # AVX512{F,VL} @@ -6470,6 +6481,17 @@ _start: vbroadcasti32x4 ymm30, XMMWORD PTR [rdx+2048] # AVX512{F,VL} vbroadcasti32x4 ymm30, XMMWORD PTR [rdx-2048] # AVX512{F,VL} Disp8 vbroadcasti32x4 ymm30, XMMWORD PTR [rdx-2064] # AVX512{F,VL} + vbroadcastsd xmm30, QWORD PTR [rcx] # AVX512{F,VL} + vbroadcastsd xmm30{k7}, QWORD PTR [rcx] # AVX512{F,VL} + vbroadcastsd xmm30{k7}{z}, QWORD PTR [rcx] # AVX512{F,VL} + vbroadcastsd xmm30, QWORD PTR [rax+r14*8+0x1234] # AVX512{F,VL} + vbroadcastsd xmm30, QWORD PTR [rdx+1016] # AVX512{F,VL} Disp8 + vbroadcastsd xmm30, QWORD PTR [rdx+1024] # AVX512{F,VL} + vbroadcastsd xmm30, QWORD PTR [rdx-1024] # AVX512{F,VL} Disp8 + vbroadcastsd xmm30, QWORD PTR [rdx-1032] # AVX512{F,VL} + vbroadcastsd xmm30, xmm29 # AVX512{F,VL} + vbroadcastsd xmm30{k7}, xmm29 # AVX512{F,VL} + vbroadcastsd xmm30{k7}{z}, xmm29 # AVX512{F,VL} vbroadcastsd ymm30, QWORD PTR [rcx] # AVX512{F,VL} vbroadcastsd ymm30{k7}, QWORD PTR [rcx] # AVX512{F,VL} vbroadcastsd ymm30{k7}{z}, QWORD PTR [rcx] # AVX512{F,VL} --- a/gas/testsuite/gas/i386/x86-64-avx512f_vl-intel.d +++ b/gas/testsuite/gas/i386/x86-64-avx512f_vl-intel.d @@ -167,6 +167,17 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 00 08 00 00[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx\+0x800\] [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a 72 80[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx-0x800\] [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx-0x810\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 31[ ]*vmovddup xmm30,QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 0f 12 31[ ]*vmovddup xmm30\{k7\},QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 8f 12 31[ ]*vmovddup xmm30\{k7\}\{z\},QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 21 ff 08 12 b4 f0 23 01 00 00[ ]*vmovddup xmm30,QWORD PTR \[rax\+r14\*8\+0x123\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 7f[ ]*vmovddup xmm30,QWORD PTR \[rdx\+0x3f8\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 00 04 00 00[ ]*vmovddup xmm30,QWORD PTR \[rdx\+0x400\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 80[ ]*vmovddup xmm30,QWORD PTR \[rdx-0x400\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 f8 fb ff ff[ ]*vmovddup xmm30,QWORD PTR \[rdx-0x408\] +[ ]*[a-f0-9]+:[ ]*62 01 ff 08 12 f5[ ]*vmovddup xmm30,xmm29 +[ ]*[a-f0-9]+:[ ]*62 01 ff 0f 12 f5[ ]*vmovddup xmm30\{k7\},xmm29 +[ ]*[a-f0-9]+:[ ]*62 01 ff 8f 12 f5[ ]*vmovddup xmm30\{k7\}\{z\},xmm29 [ ]*[a-f0-9]+:[ ]*62 62 fd 28 19 31[ ]*vbroadcastsd ymm30,QWORD PTR \[rcx\] [ ]*[a-f0-9]+:[ ]*62 62 fd 2f 19 31[ ]*vbroadcastsd ymm30\{k7\},QWORD PTR \[rcx\] [ ]*[a-f0-9]+:[ ]*62 62 fd af 19 31[ ]*vbroadcastsd ymm30\{k7\}\{z\},QWORD PTR \[rcx\] @@ -6474,6 +6485,17 @@ Disassembly of section \.text: [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 00 08 00 00[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx\+0x800\] [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a 72 80[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx-0x800\] [ ]*[a-f0-9]+:[ ]*62 62 7d 28 5a b2 f0 f7 ff ff[ ]*vbroadcasti32x4 ymm30,XMMWORD PTR \[rdx-0x810\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 31[ ]*vmovddup xmm30,QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 0f 12 31[ ]*vmovddup xmm30\{k7\},QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 8f 12 31[ ]*vmovddup xmm30\{k7\}\{z\},QWORD PTR \[rcx\] +[ ]*[a-f0-9]+:[ ]*62 21 ff 08 12 b4 f0 34 12 00 00[ ]*vmovddup xmm30,QWORD PTR \[rax\+r14\*8\+0x1234\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 7f[ ]*vmovddup xmm30,QWORD PTR \[rdx\+0x3f8\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 00 04 00 00[ ]*vmovddup xmm30,QWORD PTR \[rdx\+0x400\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 72 80[ ]*vmovddup xmm30,QWORD PTR \[rdx-0x400\] +[ ]*[a-f0-9]+:[ ]*62 61 ff 08 12 b2 f8 fb ff ff[ ]*vmovddup xmm30,QWORD PTR \[rdx-0x408\] +[ ]*[a-f0-9]+:[ ]*62 01 ff 08 12 f5[ ]*vmovddup xmm30,xmm29 +[ ]*[a-f0-9]+:[ ]*62 01 ff 0f 12 f5[ ]*vmovddup xmm30\{k7\},xmm29 +[ ]*[a-f0-9]+:[ ]*62 01 ff 8f 12 f5[ ]*vmovddup xmm30\{k7\}\{z\},xmm29 [ ]*[a-f0-9]+:[ ]*62 62 fd 28 19 31[ ]*vbroadcastsd ymm30,QWORD PTR \[rcx\] [ ]*[a-f0-9]+:[ ]*62 62 fd 2f 19 31[ ]*vbroadcastsd ymm30\{k7\},QWORD PTR \[rcx\] [ ]*[a-f0-9]+:[ ]*62 62 fd af 19 31[ ]*vbroadcastsd ymm30\{k7\}\{z\},QWORD PTR \[rcx\] --- a/opcodes/i386-opc.tbl +++ b/opcodes/i386-opc.tbl @@ -1495,6 +1495,8 @@ vblendp, 0x660c | , AVX, Mod vblendvp, 0x664a | , AVX, Modrm|Vex|Space0F3A|VexVVVV|VexW0|CheckOperandSize|NoSuf, { RegXMM|RegYMM, Unspecified|BaseIndex|RegXMM|RegYMM, RegXMM|RegYMM, RegXMM|RegYMM } vbroadcastf128, 0x661a, AVX, Modrm|Vex=2|Space0F38|VexW=1|NoSuf, { Xmmword|Unspecified|BaseIndex, RegYMM } vbroadcastsd, 0x6619, AVX, Modrm|Vex256|Space0F38|VexW0|NoSuf, { Qword|Unspecified|BaseIndex, RegYMM } +// As an extension, provide a 128-bit form as well, utilizing vmovddup. +vbroadcastsd, 0xf212, AVX, Modrm|Vex128|Space0F|VexW0|NoSuf, { Qword|Unspecified|BaseIndex, RegXMM } vbroadcastss, 0x6618, AVX, Modrm|Vex128|Space0F38|VexW0|NoSuf, { Dword|Unspecified|BaseIndex, RegXMM|RegYMM } vcmpp, 0xc2/0x, AVX, Modrm||Vex|Space0F|VexVVVV|VexWIG|CheckOperandSize|NoSuf|ImmExt, { RegXMM|RegYMM|Unspecified|BaseIndex, RegXMM|RegYMM, RegXMM|RegYMM } vcmps, 0xc2/0x, AVX, Modrm||VexLIG|Space0F|VexVVVV|VexWIG|NoSuf|ImmExt, { RegXMM||Unspecified|BaseIndex, RegXMM, RegXMM } @@ -1731,6 +1733,8 @@ vpmovzxwq, 0x6634, AVX2, Modrm|Vex=2|Spa vbroadcasti128, 0x665A, AVX2, Modrm|Vex=2|Space0F38|VexW=1|NoSuf, { Xmmword|Unspecified|BaseIndex, RegYMM } vbroadcastsd, 0x6619, AVX2, Modrm|Vex=2|Space0F38|VexW=1|NoSuf, { RegXMM, RegYMM } +// As an extension, provide a 128-bit form as well, utilizing vmovddup. +vbroadcastsd, 0xf212, AVX2, Modrm|Vex128|Space0F|VexW0|NoSuf, { RegXMM, RegXMM } vbroadcastss, 0x6618, AVX2, Modrm|Vex|Space0F38|VexW=1|NoSuf, { RegXMM, RegXMM|RegYMM } vpblendd, 0x6602, AVX2, Modrm|Vex|Space0F3A|VexVVVV|VexW0|CheckOperandSize|NoSuf, { Imm8|Imm8S, Unspecified|BaseIndex|RegXMM|RegYMM, RegXMM|RegYMM, RegXMM|RegYMM } vpbroadcast, 0x6678 | , AVX2, Modrm|Vex|Space0F38|VexW0|NoSuf, { |Unspecified|BaseIndex|RegXMM, RegXMM|RegYMM } @@ -2128,6 +2132,8 @@ vbroadcasti64x4, 0x665B, AVX512F, Modrm| vbroadcastss, 0x6618, AVX512F, Modrm|Masking|Space0F38|VexW0|Disp8MemShift=2|NoSuf, { RegXMM|Dword|Unspecified|BaseIndex, RegXMM|RegYMM|RegZMM } vbroadcastsd, 0x6619, AVX512F, Modrm|Masking|Space0F38|VexW1|Disp8MemShift=3|NoSuf, { RegXMM|Qword|Unspecified|BaseIndex, RegYMM|RegZMM } +// As an extension, provide a 128-bit form as well, utilizing vmovddup. +vbroadcastsd, 0xf212, AVX512F|AVX512VL, Modrm|EVex128|Masking|Space0F|VexW1|Disp8MemShift=3|NoSuf, { RegXMM|Qword|Unspecified|BaseIndex, RegXMM } vpbroadcast, 0x6658 | , AVX512F, Modrm|Masking|Space0F38||Disp8MemShift|NoSuf, { RegXMM||Unspecified|BaseIndex, RegXMM|RegYMM|RegZMM } vpbroadcast, 0x667c, AVX512F, Modrm|Masking|Space0F38||NoSuf, { , RegXMM|RegYMM|RegZMM }