From patchwork Mon Jan 15 09:28:28 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Srinath Parvathaneni X-Patchwork-Id: 188084 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:693c:2614:b0:101:6a76:bbe3 with SMTP id mm20csp1593847dyc; Mon, 15 Jan 2024 01:29:11 -0800 (PST) X-Google-Smtp-Source: AGHT+IEP0/UlC+hsCgpsUYLKBfOyfiL/aq/aobNVaK0Dj+R9PRtnbyn0EY/lKl+64kb2VMc5MFAA X-Received: by 2002:a05:6214:f01:b0:681:1b46:4885 with SMTP id gw1-20020a0562140f0100b006811b464885mr7747424qvb.34.1705310950388; Mon, 15 Jan 2024 01:29:10 -0800 (PST) ARC-Seal: i=4; a=rsa-sha256; t=1705310950; cv=pass; d=google.com; s=arc-20160816; b=d3oNZhGpJucejn2ghAMgVEe0sfrae2VEtD+4lpw2xXV6k0/+L017e0jsfBeHVD8+Fg Dun6oFpzGlFc1V2zWNz3xcG1ktanIVxi7PLuvVhuLpyislXTJSOQ6O4iLYgBfZ7tWmmI gP9YMjlp1as7x7eqzrlfBAkR50ZCgEcDhfNQnzgDMLB6243nqfp65OvxgjMVpzcG3EA+ mUir6v3I2939a6pOfxu+F+udkpkLHJcbwznpn/a/rFI5ij+yJMxeRuWLfzoG6jyalHe6 jAL+KmYGoWy2QdYIdpO7aV6nh8AZ43njRrCk66PPek8h9lUguDaOIVt7fY6KkAK7C3Ee bE1A== ARC-Message-Signature: i=4; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=errors-to:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:original-authentication-results :nodisclaimer:mime-version:subject:from:cc:to:content-language :user-agent:date:message-id:authentication-results-original :dkim-signature:dkim-signature:arc-filter:dmarc-filter:delivered-to; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; fh=tMP4h4gIc+f6KYjb6V9uapAAChyUxWyPJocaPGGgdmQ=; b=LQq9hrvi4Ppx8nchIougFnkX3us4zSheD/EuTHMDPxNbpcgPLZfn8MyfuahWhpV9XZ JtStLDbO0ci6LGtrZO97gT6k+kTaP1r7tW2HaywdHvY5jetoa8tPdJxVfF6K6tnkWCr2 sm/Zyy7fqc9If+OXUU7LQ0etYui2WX5L621NMd3A2vEEWg76lXZR3vRTD4X5Q72WCqQD SqzTVb0+aTIROTzNvTOTLqfOE+S69h7OSSNebGpvpXx1gSKK7bRN9wCWcsNwMNk7ilW1 7Apmxp3qbj0P1fds0jbuo8teWvB5K5VsidOnaeeMkBFhs3mGRUTd4STNFx3E00p6BDSD lAIw== ARC-Authentication-Results: i=4; mx.google.com; dkim=pass header.i=@armh.onmicrosoft.com header.s=selector2-armh-onmicrosoft-com header.b=uT6aLF3A; dkim=pass header.i=@armh.onmicrosoft.com header.s=selector2-armh-onmicrosoft-com header.b=uT6aLF3A; arc=pass (i=3); spf=pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="binutils-bounces+ouuuleilei=gmail.com@sourceware.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from server2.sourceware.org (server2.sourceware.org. [2620:52:3:1:0:246e:9693:128c]) by mx.google.com with ESMTPS id x1-20020a0ca881000000b0067ef8e49371si7413625qva.297.2024.01.15.01.29.10 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 15 Jan 2024 01:29:10 -0800 (PST) Received-SPF: pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; Authentication-Results: mx.google.com; dkim=pass header.i=@armh.onmicrosoft.com header.s=selector2-armh-onmicrosoft-com header.b=uT6aLF3A; dkim=pass header.i=@armh.onmicrosoft.com header.s=selector2-armh-onmicrosoft-com header.b=uT6aLF3A; arc=pass (i=3); spf=pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="binutils-bounces+ouuuleilei=gmail.com@sourceware.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 117413858287 for ; Mon, 15 Jan 2024 09:29:10 +0000 (GMT) X-Original-To: binutils@sourceware.org Delivered-To: binutils@sourceware.org Received: from EUR05-DB8-obe.outbound.protection.outlook.com (mail-db8eur05on2048.outbound.protection.outlook.com [40.107.20.48]) by sourceware.org (Postfix) with ESMTPS id 1FBDC385841B for ; Mon, 15 Jan 2024 09:28:54 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 1FBDC385841B Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=arm.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 1FBDC385841B Authentication-Results: server2.sourceware.org; arc=pass smtp.remote-ip=40.107.20.48 ARC-Seal: i=3; a=rsa-sha256; d=sourceware.org; s=key; t=1705310938; cv=pass; b=YYGpti5fcUOsUz7uht49B8OTUAOMN1dLMKBWC6OpgaWObIRZiLGOc1b+CbGNGDc5hfygJT2K1LN3wMZruMoeZBfAeQ7cOfvqYuVwkcvI7La6CRczC2lqmBOjuH832nm794SK2fk5nOZdxb87dvNOb1MUA2amMJFA31ZOLkI8Xx0= ARC-Message-Signature: i=3; a=rsa-sha256; d=sourceware.org; s=key; t=1705310938; c=relaxed/simple; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; h=DKIM-Signature:DKIM-Signature:Message-ID:Date:To:From:Subject: MIME-Version; b=Z7QROvfnSySpQI2hknRgDD5HcS8m/+0abPzXcS09dLeBK/L9i0BI3SxO1Zyl1oyukqI/fzwAGUFPma+19Zh+nbjZCorQ6Q8iIonpj8reWgnKtwM8YAieN7wLYQMpTL2pkQaWux/5o9xTxnbT8d+kLXZDHt/It2wjZ/Wupfvsedo= ARC-Authentication-Results: i=3; server2.sourceware.org ARC-Seal: i=2; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=pass; b=ko9uEyuQLkjmUmYpiaT2sPyjm+yZqKiXn1mqH6lQWihj9ZD3s9zJeUTC3FFZ0AIv2HJ2UIdnuNNST+7ZyH3r7iovQsTUbyokkDBYpHZbDtPT/hx1P4YPu6CJy/8yc3ZpMju6qYwpBg02VgXV2PRxwL8XqG5QVNLFbLAZfD3CXjByq/4rflTk7BSFPa+JXhDSLqPJZqEEbLv5cbWphwrYwkREwafrLYuBu1R2INksIItmWmPQjSiWOQldhaLfWPzMVK1/nE6JaEXIfBurOAB1IAWhZ7hi54x0rOZXg1C0uuJqJQIc6Rq5+UIIp4MpTufK/swPYKB0aaA5+jprHVp/Og== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; b=cp7CBkeKR/tFe8F6QreZa9Qv8SVKThjGBzsUfpO4eVirt9S6Me+MSNiy8m3eqxlmUgC0TNmBPvx1IzL6a1LjUA39t3fXOG8RRZEDfY0/oP9xN1B7O0kuokNvvEaHRHAw9dp4TFjXiTpuxkErhKVY+BecxmYbbpdLiGRIVM8YcW/bNLAzBMsXb7+CIuMj4g7KJiMtxp9B5tkgCa7RTJc6MLDj7YgJGc+QV23kl09o94ZTGJRYAVEd5m00rOl6XE+V4k8bTxc6V1J06HBEEzT3SEE7zd1/80UTmx9pWDo+7mpvSZRDCc2UGXbP8aEJnvCqwFsoJa3wuccfz1RwhwqQpg== ARC-Authentication-Results: i=2; mx.microsoft.com 1; spf=pass (sender ip is 63.35.35.123) smtp.rcpttodomain=sourceware.org smtp.mailfrom=arm.com; dmarc=pass (p=none sp=none pct=100) action=none header.from=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com; arc=pass (0 oda=1 ltdi=1 spf=[1,1,smtp.mailfrom=arm.com] dkim=[1,1,header.d=arm.com] dmarc=[1,1,header.from=arm.com]) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; b=uT6aLF3AZe3L257R17cjiAEl01gKZvqw9fAOv7fJ6w4XFCXHn1fhQO//vJEiTJcrXV72dUHQJs0GbR0SUkaI94GgE+VmNgkzBT5Rqtvl7XhYUKxSn5VVnMq4GZEhVWW4kHHazWcsf6Xy8gUnHvF45EZ8JtcnfScJZyNvKUzKsKM= Received: from AS4P192CA0034.EURP192.PROD.OUTLOOK.COM (2603:10a6:20b:658::13) by VI1PR08MB10101.eurprd08.prod.outlook.com (2603:10a6:800:1ca::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7181.17; Mon, 15 Jan 2024 09:28:50 +0000 Received: from AM1PEPF000252DD.eurprd07.prod.outlook.com (2603:10a6:20b:658:cafe::1e) by AS4P192CA0034.outlook.office365.com (2603:10a6:20b:658::13) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7181.23 via Frontend Transport; Mon, 15 Jan 2024 09:28:50 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; pr=C Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by AM1PEPF000252DD.mail.protection.outlook.com (10.167.16.55) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7202.16 via Frontend Transport; Mon, 15 Jan 2024 09:28:50 +0000 Received: ("Tessian outbound 1076c872ecc6:v228"); Mon, 15 Jan 2024 09:28:50 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 82defde1b602a84f X-CR-MTA-TID: 64aa7808 Received: from 6e277e11a626.2 by 64aa7808-outbound-1.mta.getcheckrecipient.com id 00D04283-E309-4AA9-B7B3-FF98D5FBCDB4.1; Mon, 15 Jan 2024 09:28:38 +0000 Received: from EUR04-VI1-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id 6e277e11a626.2 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Mon, 15 Jan 2024 09:28:38 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=bR3I4D1R5mNSkkk4Pqcs87K4pJ/TONbZRSrGF9IQy/V7tXsmrH3lulryEIqo4oOWIXbRImCaP8NWeIkrjwYhJwgmAZ31q28gmU4LhY3bG3nF09mXFdMLukxuJc3l468pD106/BGHMP2kzZCNa1BvMT/rVcI4lykXpWE7qmJyDNkOSMF8hCbovNz0JmyxDKO/oPUd5plQszKLQQ0kH9XZD/zIj980Oal7uwnKj7KU4uqVHRD5vmBfFw+kuFMwmdaEbxqCYJJWZBegRhps8MuxE6AcxlcT62wSOWC163ue4W8YbvPcd6B2SqRYZ0KQV7B72BkoN8S3RSTXuslCqxjGgA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; b=lf4Fksx73ngcForGtYPuNnnf7h1udxqThBZl4MQYtbCxqBpr62R7IcI8Vce44/6tl1YA6sK+Bb3WKYEHo8gCbjetNTQ3rJmnkrdk1ULuhb97tR2wIY+IsfTv3bTDrwWcw+YeRHYagnqR5sjKjKybkl3ntCiqGavHt3xADb7+mwTwdjRVURlzWsZzH3YCZZ8CNAPwA6xlaag1RPo0twv6euPj3KzPQftaQ4lWEL+bmgVQweAAlyLXDJvNvOm1bHWTrcG+PRe/MLPa0bXhaCHbmukMu2/VvDQlIx2GnslYuKevbgFluCEqhAUIMATnoS6wURJsAPXzx3nyaGJMLRqJHA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=armh.onmicrosoft.com; s=selector2-armh-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=JG8mMXrdIc26Wjd3ve8aRzOBnvXvb2haOvu3lgxTD90=; b=uT6aLF3AZe3L257R17cjiAEl01gKZvqw9fAOv7fJ6w4XFCXHn1fhQO//vJEiTJcrXV72dUHQJs0GbR0SUkaI94GgE+VmNgkzBT5Rqtvl7XhYUKxSn5VVnMq4GZEhVWW4kHHazWcsf6Xy8gUnHvF45EZ8JtcnfScJZyNvKUzKsKM= Authentication-Results-Original: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; Received: from VE1PR08MB4893.eurprd08.prod.outlook.com (2603:10a6:802:aa::13) by AS8PR08MB7992.eurprd08.prod.outlook.com (2603:10a6:20b:571::14) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.7181.17; Mon, 15 Jan 2024 09:28:32 +0000 Received: from VE1PR08MB4893.eurprd08.prod.outlook.com ([fe80::bfa1:3b17:7c9a:5feb]) by VE1PR08MB4893.eurprd08.prod.outlook.com ([fe80::bfa1:3b17:7c9a:5feb%7]) with mapi id 15.20.7181.022; Mon, 15 Jan 2024 09:28:32 +0000 Message-ID: <73155200-f7c2-4226-b4be-4a320ea82044@arm.com> Date: Mon, 15 Jan 2024 09:28:28 +0000 User-Agent: Mozilla Thunderbird Content-Language: en-US To: binutils@sourceware.org Cc: richard.earnshaw@arm.com, nickc@redhat.com From: Srinath Parvathaneni Subject: [PATCH 1/6] [Binutils] aarch64: Add support for FEAT_B16B16 instructions. X-ClientProxiedBy: LO2P265CA0180.GBRP265.PROD.OUTLOOK.COM (2603:10a6:600:a::24) To VE1PR08MB4893.eurprd08.prod.outlook.com (2603:10a6:802:aa::13) MIME-Version: 1.0 X-MS-TrafficTypeDiagnostic: VE1PR08MB4893:EE_|AS8PR08MB7992:EE_|AM1PEPF000252DD:EE_|VI1PR08MB10101:EE_ X-MS-Office365-Filtering-Correlation-Id: b13cdc9b-e876-4493-4e25-08dc15ac620e X-LD-Processed: f34e5979-57d9-4aaa-ad4d-b122a662184d,ExtAddr x-checkrecipientrouted: true NoDisclaimer: true X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: DkHbP7OBlS9HLTcLTxi9S8qNGXBJgov/uE9fVHLMddS1581aEfsVUVYPJ7doQxqpZbAIbqFvpoTGJhO9zzMaLkxkl3pTTFRnjvrDhjuoDi82aHMz/fKSNmYpdo8A2W4e7O666dHKlDzzM0wLVfc5uMZWYFYr0h6dB2SPZ9rmVEVU7gvMGPAGitpPFtwdczDTfE3Ia6QlZDNo3TWsxBFu9XpXtsDN5ghAk3F9+j3Pl3hU4mR/j8FBTn2uZSTS/VVdYMIFqTmw7Ozk0M+Pyu7NHTkzHgn/x/PuIbYDCI+f/v4UXc5og7PEpZfwkFRi3uBVHmChBh2aTkiZxRwwXCwhyvuRBynAERCvyU9t3glJrz/5Svmazbd7SZ4tBRJbWzI5FsJTPFRrttCZZVO7J1Sb/jgCPie7RVCTx9d/cL0XtRX3McOqAVHiNuse6336b88oXfU2bW//toBqXeoRpGRdDKyMx8IuiPW2bKm1+O/0Mt8KLO4q2qdkDC5R+Q41QL6Op38WGGWURe3QlYSYkap9IZlJH1/ycemP5ednSnl4hT2oHftumRQitvqZ+VUkYNDLrNLwSiCC8hozhCdDjXM3Gu03qClE+sTLDwSyQcDXMoq1fld8vfg8BH298z3LLlsUJWeDreM3lMi7rzy9Rx7pSg== X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:VE1PR08MB4893.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(13230031)(346002)(396003)(39860400002)(136003)(376002)(366004)(230922051799003)(186009)(451199024)(1800799012)(64100799003)(2616005)(44832011)(26005)(5660300002)(38100700002)(6506007)(6916009)(235185007)(6512007)(66946007)(66556008)(41300700001)(4326008)(316002)(2906002)(66476007)(8676002)(8936002)(6666004)(478600001)(33964004)(86362001)(6486002)(31696002)(36756003)(31686004)(45980500001)(43740500002); DIR:OUT; SFP:1101; X-MS-Exchange-Transport-CrossTenantHeadersStamped: AS8PR08MB7992 Original-Authentication-Results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: AM1PEPF000252DD.eurprd07.prod.outlook.com X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id-Prvs: 2f9ef4a2-ac26-44f0-e0e4-08dc15ac5756 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: mE4OtSkieTeSHF8kMjL1wlYmLZhuI8k2aYm611K1I5BPlEHwwo46wA7ucaHpDC3PSpRnMb6MgmlSVt59zUCyyNsEGM3Br8rdY5ogdppOC6QY230wOVtTqWaLCP/NPcRXOEWnY77Cyg4FmO+KwsV4jvC36xJ9zVZQZASqbG4RcfFuNJZy21q7t7Qpe123mlKPXw7C9wpCaD2pgd4NnuqqWnmbxy1biHJuFuVPFS9I1FMwlvW+keNCieW+R7rxtk9uVnsBtgtuXtYXKFyRQz0nL9eH6GktN/1xgq78zRjNnwoXdhJwBgCkzfD5sD4qBWUPKk8WoKeMun+elrDHQl5rxVs+nvQHcCgGouMUdiNoYjjBqt+kquhCEBZBMCp0RzPzl2DuBPLBPg+JEogi26ZYW75kks5/lf6PDC0zC8+svbkv1m/kr36++oovnGtwpu1h7n3p8PKFAgWwY8I8TxwNbEY6LcA/el+DvLNYQirhiYOz/DKuCjF2mu+KWMV7qieKAENtEzVapkartn+/TPV+hzI7Zw8gzhIvFc4zF3qpmoRoJZ0a5oQct+iHOa20qxztfEd4J7KTCEU7wTqV6tFouT9wXFEJDB4SLnUN1X9f5nind3IrA7KjvLgSdC2rleKuIGwkbs60wmwud6BEKWg49b/O5StLxJXesRFt9rdkMSemjz0E/CP0zOnPFihF6UH5RxlYtIkrmENL+1XpYP6hvKvueJZy1vjiIaLkRYOjts9jvvUauC1cr8WlbcmNi6/C X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(13230031)(4636009)(346002)(396003)(136003)(39860400002)(376002)(230922051799003)(1800799012)(451199024)(186009)(82310400011)(64100799003)(36840700001)(46966006)(33964004)(70206006)(2616005)(70586007)(26005)(5660300002)(81166007)(107886003)(336012)(82740400003)(235185007)(6916009)(8676002)(41300700001)(6512007)(356005)(44832011)(4326008)(8936002)(36860700001)(2906002)(316002)(6666004)(6506007)(6486002)(86362001)(47076005)(478600001)(31696002)(36756003)(40480700001)(31686004)(43740500002); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 15 Jan 2024 09:28:50.2649 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: b13cdc9b-e876-4493-4e25-08dc15ac620e X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: AM1PEPF000252DD.eurprd07.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: VI1PR08MB10101 X-Spam-Status: No, score=-12.5 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, FORGED_SPF_HELO, GIT_PATCH_0, KAM_DMARC_NONE, KAM_LOTSOFHASH, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_NONE, TXREP, T_SCC_BODY_TEXT_LINE, UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: binutils@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Binutils mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: binutils-bounces+ouuuleilei=gmail.com@sourceware.org X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1788148134805800978 X-GMAIL-MSGID: 1788148134805800978 Hi, This patch add support for SVE2.1 and SME2.1 non-widening BFloat16 (FEAT_B16B16) instructions. Following instructions predicated, unpredicated and indexed variants are added in this patch. bfadd, bfclamp, bfmax bfmaxnm, bfmin,bfminnm, bfmla,bfmls,bfmul and bfsub. Regression testing for aarch64-none-elf target and found no regressions. Ok for binutils-master? Regards, Srinath. diff --git a/gas/config/tc-aarch64.c b/gas/config/tc-aarch64.c index 7eb732adbb6c85fdf4db7c4b14d0be5fafa370b6..bc40d126632e093b02268fd7474f4cf0c6ddf6d7 100644 --- a/gas/config/tc-aarch64.c +++ b/gas/config/tc-aarch64.c @@ -10335,6 +10335,7 @@ static const struct aarch64_option_cpu_value_table aarch64_features[] = { {"ite", AARCH64_FEATURE (ITE), AARCH64_NO_FEATURES}, {"d128", AARCH64_FEATURE (D128), AARCH64_FEATURE (LSE128)}, + {"b16b16", AARCH64_FEATURE (B16B16), AARCH64_FEATURE (SVE2)}, {NULL, AARCH64_NO_FEATURES, AARCH64_NO_FEATURES}, }; diff --git a/gas/testsuite/gas/aarch64/bfloat16-1.d b/gas/testsuite/gas/aarch64/bfloat16-1.d new file mode 100644 index 0000000000000000000000000000000000000000..f0d436bec585ff2aee2e007d63fc672a11a569b9 --- /dev/null +++ b/gas/testsuite/gas/aarch64/bfloat16-1.d @@ -0,0 +1,106 @@ +#name: Test of SVE2.1 and SME2.1 non-widening BFloat16 instructions. +#as: -march=armv9.4-a+b16b16 +#objdump: -dr + +[^:]+: file format .* + + +[^:]+: + +[^:]+: +.*: 65008200 bfadd z0.h, p0\/m, z0.h, z16.h +.*: 65008501 bfadd z1.h, p1\/m, z1.h, z8.h +.*: 65008882 bfadd z2.h, p2\/m, z2.h, z4.h +.*: 65009044 bfadd z4.h, p4\/m, z4.h, z2.h +.*: 65009828 bfadd z8.h, p6\/m, z8.h, z1.h +.*: 65009c10 bfadd z16.h, p7\/m, z16.h, z0.h +.*: 65068200 bfmax z0.h, p0\/m, z0.h, z16.h +.*: 65068501 bfmax z1.h, p1\/m, z1.h, z8.h +.*: 65068882 bfmax z2.h, p2\/m, z2.h, z4.h +.*: 65069044 bfmax z4.h, p4\/m, z4.h, z2.h +.*: 65069828 bfmax z8.h, p6\/m, z8.h, z1.h +.*: 65069c10 bfmax z16.h, p7\/m, z16.h, z0.h +.*: 65048200 bfmaxnm z0.h, p0\/m, z0.h, z16.h +.*: 65048501 bfmaxnm z1.h, p1\/m, z1.h, z8.h +.*: 65048882 bfmaxnm z2.h, p2\/m, z2.h, z4.h +.*: 65049044 bfmaxnm z4.h, p4\/m, z4.h, z2.h +.*: 65049828 bfmaxnm z8.h, p6\/m, z8.h, z1.h +.*: 65049c10 bfmaxnm z16.h, p7\/m, z16.h, z0.h +.*: 65078200 bfmin z0.h, p0\/m, z0.h, z16.h +.*: 65078501 bfmin z1.h, p1\/m, z1.h, z8.h +.*: 65078882 bfmin z2.h, p2\/m, z2.h, z4.h +.*: 65079044 bfmin z4.h, p4\/m, z4.h, z2.h +.*: 65079828 bfmin z8.h, p6\/m, z8.h, z1.h +.*: 65079c10 bfmin z16.h, p7\/m, z16.h, z0.h +.*: 65058200 bfminnm z0.h, p0\/m, z0.h, z16.h +.*: 65058501 bfminnm z1.h, p1\/m, z1.h, z8.h +.*: 65058882 bfminnm z2.h, p2\/m, z2.h, z4.h +.*: 65059044 bfminnm z4.h, p4\/m, z4.h, z2.h +.*: 65059828 bfminnm z8.h, p6\/m, z8.h, z1.h +.*: 65059c10 bfminnm z16.h, p7\/m, z16.h, z0.h +.*: 65100080 bfadd z0.h, z4.h, z16.h +.*: 65080101 bfadd z1.h, z8.h, z8.h +.*: 65040182 bfadd z2.h, z12.h, z4.h +.*: 65020204 bfadd z4.h, z16.h, z2.h +.*: 65010288 bfadd z8.h, z20.h, z1.h +.*: 65000310 bfadd z16.h, z24.h, z0.h +.*: 64302480 bfclamp z0.h, z4.h, z16.h +.*: 64282501 bfclamp z1.h, z8.h, z8.h +.*: 64242582 bfclamp z2.h, z12.h, z4.h +.*: 64222604 bfclamp z4.h, z16.h, z2.h +.*: 64212688 bfclamp z8.h, z20.h, z1.h +.*: 64202710 bfclamp z16.h, z24.h, z0.h +.*: 65300000 bfmla z0.h, p0\/m, z0.h, z16.h +.*: 65280421 bfmla z1.h, p1\/m, z1.h, z8.h +.*: 65240842 bfmla z2.h, p2\/m, z2.h, z4.h +.*: 65221084 bfmla z4.h, p4\/m, z4.h, z2.h +.*: 65211908 bfmla z8.h, p6\/m, z8.h, z1.h +.*: 65201e10 bfmla z16.h, p7\/m, z16.h, z0.h +.*: 643e0a00 bfmla z0.h, z16.h, z6.h\[7\] +.*: 643d0901 bfmla z1.h, z8.h, z5.h\[7\] +.*: 643409c2 bfmla z2.h, z14.h, z4.h\[5\] +.*: 642a0aa4 bfmla z4.h, z21.h, z2.h\[3\] +.*: 64210988 bfmla z8.h, z12.h, z1.h\[1\] +.*: 64200950 bfmla z16.h, z10.h, z0.h\[1\] +.*: 65302000 bfmls z0.h, p0\/m, z0.h, z16.h +.*: 65282421 bfmls z1.h, p1\/m, z1.h, z8.h +.*: 65242842 bfmls z2.h, p2\/m, z2.h, z4.h +.*: 65223084 bfmls z4.h, p4\/m, z4.h, z2.h +.*: 65213908 bfmls z8.h, p6\/m, z8.h, z1.h +.*: 65203e10 bfmls z16.h, p7\/m, z16.h, z0.h +.*: 643e0e00 bfmls z0.h, z16.h, z6.h\[7\] +.*: 643d0d01 bfmls z1.h, z8.h, z5.h\[7\] +.*: 64340dc2 bfmls z2.h, z14.h, z4.h\[5\] +.*: 642a0ea4 bfmls z4.h, z21.h, z2.h\[3\] +.*: 64210d88 bfmls z8.h, z12.h, z1.h\[1\] +.*: 64200d50 bfmls z16.h, z10.h, z0.h\[1\] +.*: 65028200 bfmul z0.h, p0\/m, z0.h, z16.h +.*: 65028501 bfmul z1.h, p1\/m, z1.h, z8.h +.*: 65028882 bfmul z2.h, p2\/m, z2.h, z4.h +.*: 65029044 bfmul z4.h, p4\/m, z4.h, z2.h +.*: 65029828 bfmul z8.h, p6\/m, z8.h, z1.h +.*: 65029c10 bfmul z16.h, p7\/m, z16.h, z0.h +.*: 65100880 bfmul z0.h, z4.h, z16.h +.*: 65080901 bfmul z1.h, z8.h, z8.h +.*: 65040982 bfmul z2.h, z12.h, z4.h +.*: 65020a04 bfmul z4.h, z16.h, z2.h +.*: 65010a88 bfmul z8.h, z20.h, z1.h +.*: 65000b10 bfmul z16.h, z24.h, z0.h +.*: 643e2a00 bfmul z0.h, z16.h, z6.h\[7\] +.*: 643d2901 bfmul z1.h, z8.h, z5.h\[7\] +.*: 643429c2 bfmul z2.h, z14.h, z4.h\[5\] +.*: 642a2aa4 bfmul z4.h, z21.h, z2.h\[3\] +.*: 64212988 bfmul z8.h, z12.h, z1.h\[1\] +.*: 64202950 bfmul z16.h, z10.h, z0.h\[1\] +.*: 65018200 bfsub z0.h, p0\/m, z0.h, z16.h +.*: 65018501 bfsub z1.h, p1\/m, z1.h, z8.h +.*: 65018882 bfsub z2.h, p2\/m, z2.h, z4.h +.*: 65019044 bfsub z4.h, p4\/m, z4.h, z2.h +.*: 65019828 bfsub z8.h, p6\/m, z8.h, z1.h +.*: 65019c10 bfsub z16.h, p7\/m, z16.h, z0.h +.*: 65100480 bfsub z0.h, z4.h, z16.h +.*: 65080501 bfsub z1.h, z8.h, z8.h +.*: 65040582 bfsub z2.h, z12.h, z4.h +.*: 65020604 bfsub z4.h, z16.h, z2.h +.*: 65010688 bfsub z8.h, z20.h, z1.h +.*: 65000710 bfsub z16.h, z24.h, z0.h diff --git a/gas/testsuite/gas/aarch64/bfloat16-1.s b/gas/testsuite/gas/aarch64/bfloat16-1.s new file mode 100644 index 0000000000000000000000000000000000000000..5597d9ef01906f7316149cdf0bb69addeb849926 --- /dev/null +++ b/gas/testsuite/gas/aarch64/bfloat16-1.s @@ -0,0 +1,112 @@ +bfadd z0.h, p0/m, z0.h, z16.h +bfadd z1.h, p1/m, z1.h, z8.h +bfadd z2.h, p2/m, z2.h, z4.h +bfadd z4.h, p4/m, z4.h, z2.h +bfadd z8.h, p6/m, z8.h, z1.h +bfadd z16.h, p7/m, z16.h, z0.h + +bfmax z0.h, p0/m, z0.h, z16.h +bfmax z1.h, p1/m, z1.h, z8.h +bfmax z2.h, p2/m, z2.h, z4.h +bfmax z4.h, p4/m, z4.h, z2.h +bfmax z8.h, p6/m, z8.h, z1.h +bfmax z16.h, p7/m, z16.h, z0.h + +bfmaxnm z0.h, p0/m, z0.h, z16.h +bfmaxnm z1.h, p1/m, z1.h, z8.h +bfmaxnm z2.h, p2/m, z2.h, z4.h +bfmaxnm z4.h, p4/m, z4.h, z2.h +bfmaxnm z8.h, p6/m, z8.h, z1.h +bfmaxnm z16.h, p7/m, z16.h, z0.h + +bfmin z0.h, p0/m, z0.h, z16.h +bfmin z1.h, p1/m, z1.h, z8.h +bfmin z2.h, p2/m, z2.h, z4.h +bfmin z4.h, p4/m, z4.h, z2.h +bfmin z8.h, p6/m, z8.h, z1.h +bfmin z16.h, p7/m, z16.h, z0.h + +bfminnm z0.h, p0/m, z0.h, z16.h +bfminnm z1.h, p1/m, z1.h, z8.h +bfminnm z2.h, p2/m, z2.h, z4.h +bfminnm z4.h, p4/m, z4.h, z2.h +bfminnm z8.h, p6/m, z8.h, z1.h +bfminnm z16.h, p7/m, z16.h, z0.h + +bfadd z0.h, z4.h, z16.h +bfadd z1.h, z8.h, z8.h +bfadd z2.h, z12.h, z4.h +bfadd z4.h, z16.h, z2.h +bfadd z8.h, z20.h, z1.h +bfadd z16.h, z24.h, z0.h + +bfclamp z0.h, z4.h, z16.h +bfclamp z1.h, z8.h, z8.h +bfclamp z2.h, z12.h, z4.h +bfclamp z4.h, z16.h, z2.h +bfclamp z8.h, z20.h, z1.h +bfclamp z16.h, z24.h, z0.h +bfmla z0.h, p0/m, z0.h, z16.h +bfmla z1.h, p1/m, z1.h, z8.h +bfmla z2.h, p2/m, z2.h, z4.h +bfmla z4.h, p4/m, z4.h, z2.h +bfmla z8.h, p6/m, z8.h, z1.h +bfmla z16.h, p7/m, z16.h, z0.h + +bfmla z0.h, z16.h, z6.h[7] +bfmla z1.h, z8.h, z5.h[6] +bfmla z2.h, z14.h, z4.h[4] +bfmla z4.h, z21.h, z2.h[2] +bfmla z8.h, z12.h, z1.h[1] +bfmla z16.h, z10.h, z0.h[0] + +bfmls z0.h, p0/m, z0.h, z16.h +bfmls z1.h, p1/m, z1.h, z8.h +bfmls z2.h, p2/m, z2.h, z4.h +bfmls z4.h, p4/m, z4.h, z2.h +bfmls z8.h, p6/m, z8.h, z1.h +bfmls z16.h, p7/m, z16.h, z0.h + +bfmls z0.h, z16.h, z6.h[7] +bfmls z1.h, z8.h, z5.h[6] +bfmls z2.h, z14.h, z4.h[4] +bfmls z4.h, z21.h, z2.h[2] +bfmls z8.h, z12.h, z1.h[1] +bfmls z16.h, z10.h, z0.h[0] + +bfmul z0.h, p0/m, z0.h, z16.h +bfmul z1.h, p1/m, z1.h, z8.h +bfmul z2.h, p2/m, z2.h, z4.h +bfmul z4.h, p4/m, z4.h, z2.h +bfmul z8.h, p6/m, z8.h, z1.h +bfmul z16.h, p7/m, z16.h, z0.h + +bfmul z0.h, z4.h, z16.h +bfmul z1.h, z8.h, z8.h +bfmul z2.h, z12.h, z4.h +bfmul z4.h, z16.h, z2.h +bfmul z8.h, z20.h, z1.h +bfmul z16.h, z24.h, z0.h + +bfmul z0.h, z16.h, z6.h[7] +bfmul z1.h, z8.h, z5.h[6] +bfmul z2.h, z14.h, z4.h[4] +bfmul z4.h, z21.h, z2.h[2] +bfmul z8.h, z12.h, z1.h[1] +bfmul z16.h, z10.h, z0.h[0] + +bfsub z0.h, p0/m, z0.h, z16.h +bfsub z1.h, p1/m, z1.h, z8.h +bfsub z2.h, p2/m, z2.h, z4.h +bfsub z4.h, p4/m, z4.h, z2.h +bfsub z8.h, p6/m, z8.h, z1.h +bfsub z16.h, p7/m, z16.h, z0.h + +bfsub z0.h, z4.h, z16.h +bfsub z1.h, z8.h, z8.h +bfsub z2.h, z12.h, z4.h +bfsub z4.h, z16.h, z2.h +bfsub z8.h, z20.h, z1.h +bfsub z16.h, z24.h, z0.h + + diff --git a/gas/testsuite/gas/aarch64/bfloat16-bad.d b/gas/testsuite/gas/aarch64/bfloat16-bad.d new file mode 100644 index 0000000000000000000000000000000000000000..10d2b001c1a39851ab020e20997f2774663dc3ba --- /dev/null +++ b/gas/testsuite/gas/aarch64/bfloat16-bad.d @@ -0,0 +1,4 @@ +#name: Negative test of Bfloat16 instructions. +#as: -march=armv9.4-a +#source: bfloat16-1.s +#error_output: bfloat16-bad.l diff --git a/gas/testsuite/gas/aarch64/bfloat16-bad.l b/gas/testsuite/gas/aarch64/bfloat16-bad.l new file mode 100644 index 0000000000000000000000000000000000000000..5a5192b329cd250914c860de5331ef3952ef846b --- /dev/null +++ b/gas/testsuite/gas/aarch64/bfloat16-bad.l @@ -0,0 +1,97 @@ +.*: Assembler messages: +.*: Error: selected processor does not support `bfadd z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfadd z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfadd z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfadd z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfadd z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfadd z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmax z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmax z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmax z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmax z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmax z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmax z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmaxnm z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmaxnm z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmaxnm z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmaxnm z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmaxnm z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmaxnm z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmin z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmin z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmin z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmin z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmin z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmin z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfminnm z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfminnm z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfminnm z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfminnm z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfminnm z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfminnm z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfadd z0.h,z4.h,z16.h' +.*: Error: selected processor does not support `bfadd z1.h,z8.h,z8.h' +.*: Error: selected processor does not support `bfadd z2.h,z12.h,z4.h' +.*: Error: selected processor does not support `bfadd z4.h,z16.h,z2.h' +.*: Error: selected processor does not support `bfadd z8.h,z20.h,z1.h' +.*: Error: selected processor does not support `bfadd z16.h,z24.h,z0.h' +.*: Error: selected processor does not support `bfclamp z0.h,z4.h,z16.h' +.*: Error: selected processor does not support `bfclamp z1.h,z8.h,z8.h' +.*: Error: selected processor does not support `bfclamp z2.h,z12.h,z4.h' +.*: Error: selected processor does not support `bfclamp z4.h,z16.h,z2.h' +.*: Error: selected processor does not support `bfclamp z8.h,z20.h,z1.h' +.*: Error: selected processor does not support `bfclamp z16.h,z24.h,z0.h' +.*: Error: selected processor does not support `bfmla z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmla z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmla z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmla z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmla z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmla z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmla z0.h,z16.h,z6.h\[7\]' +.*: Error: selected processor does not support `bfmla z1.h,z8.h,z5.h\[6\]' +.*: Error: selected processor does not support `bfmla z2.h,z14.h,z4.h\[4\]' +.*: Error: selected processor does not support `bfmla z4.h,z21.h,z2.h\[2\]' +.*: Error: selected processor does not support `bfmla z8.h,z12.h,z1.h\[1\]' +.*: Error: selected processor does not support `bfmla z16.h,z10.h,z0.h\[0\]' +.*: Error: selected processor does not support `bfmls z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmls z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmls z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmls z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmls z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmls z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmls z0.h,z16.h,z6.h\[7\]' +.*: Error: selected processor does not support `bfmls z1.h,z8.h,z5.h\[6\]' +.*: Error: selected processor does not support `bfmls z2.h,z14.h,z4.h\[4\]' +.*: Error: selected processor does not support `bfmls z4.h,z21.h,z2.h\[2\]' +.*: Error: selected processor does not support `bfmls z8.h,z12.h,z1.h\[1\]' +.*: Error: selected processor does not support `bfmls z16.h,z10.h,z0.h\[0\]' +.*: Error: selected processor does not support `bfmul z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfmul z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfmul z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfmul z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfmul z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfmul z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfmul z0.h,z4.h,z16.h' +.*: Error: selected processor does not support `bfmul z1.h,z8.h,z8.h' +.*: Error: selected processor does not support `bfmul z2.h,z12.h,z4.h' +.*: Error: selected processor does not support `bfmul z4.h,z16.h,z2.h' +.*: Error: selected processor does not support `bfmul z8.h,z20.h,z1.h' +.*: Error: selected processor does not support `bfmul z16.h,z24.h,z0.h' +.*: Error: selected processor does not support `bfmul z0.h,z16.h,z6.h\[7\]' +.*: Error: selected processor does not support `bfmul z1.h,z8.h,z5.h\[6\]' +.*: Error: selected processor does not support `bfmul z2.h,z14.h,z4.h\[4\]' +.*: Error: selected processor does not support `bfmul z4.h,z21.h,z2.h\[2\]' +.*: Error: selected processor does not support `bfmul z8.h,z12.h,z1.h\[1\]' +.*: Error: selected processor does not support `bfmul z16.h,z10.h,z0.h\[0\]' +.*: Error: selected processor does not support `bfsub z0.h,p0\/m,z0.h,z16.h' +.*: Error: selected processor does not support `bfsub z1.h,p1\/m,z1.h,z8.h' +.*: Error: selected processor does not support `bfsub z2.h,p2\/m,z2.h,z4.h' +.*: Error: selected processor does not support `bfsub z4.h,p4\/m,z4.h,z2.h' +.*: Error: selected processor does not support `bfsub z8.h,p6\/m,z8.h,z1.h' +.*: Error: selected processor does not support `bfsub z16.h,p7\/m,z16.h,z0.h' +.*: Error: selected processor does not support `bfsub z0.h,z4.h,z16.h' +.*: Error: selected processor does not support `bfsub z1.h,z8.h,z8.h' +.*: Error: selected processor does not support `bfsub z2.h,z12.h,z4.h' +.*: Error: selected processor does not support `bfsub z4.h,z16.h,z2.h' +.*: Error: selected processor does not support `bfsub z8.h,z20.h,z1.h' +.*: Error: selected processor does not support `bfsub z16.h,z24.h,z0.h' diff --git a/include/opcode/aarch64.h b/include/opcode/aarch64.h index 9d64d7a0ebefa4014f30a46c5be7bda124666327..e2ca92361b46a27f67d315d155eb3a9608176cb7 100644 --- a/include/opcode/aarch64.h +++ b/include/opcode/aarch64.h @@ -222,6 +222,8 @@ enum aarch64_feature_bit { AARCH64_FEATURE_PMUv3_ICNTR, /* Performance Monitors Synchronous-Exception-Based Event Extension. */ AARCH64_FEATURE_SEBEP, + /* SVE2.1 and SME2.1 non-widening BFloat16 instructions. */ + AARCH64_FEATURE_B16B16, AARCH64_NUM_FEATURES }; diff --git a/opcodes/aarch64-tbl.h b/opcodes/aarch64-tbl.h index 0cf195d03216a38e1a9b5e06b80af064e2440b91..a8ccdafd044efd62d11ba1e4c199792f6dd44559 100644 --- a/opcodes/aarch64-tbl.h +++ b/opcodes/aarch64-tbl.h @@ -1761,6 +1761,10 @@ { \ QLF3(S_S,NIL,S_S), \ } +#define OP_SVE_SMSS \ +{ \ + QLF4(S_H,P_M,S_H,S_H), \ +} #define OP_SVE_SUU \ { \ QLF3(S_S,NIL,NIL), \ @@ -2608,6 +2612,8 @@ static const aarch64_feature_set aarch64_feature_the = AARCH64_FEATURE (THE); static const aarch64_feature_set aarch64_feature_d128_the = AARCH64_FEATURES (2, D128, THE); +static const aarch64_feature_set aarch64_feature_b16b16 = + AARCH64_FEATURE (B16B16); #define CORE &aarch64_feature_v8 #define FP &aarch64_feature_fp @@ -2670,6 +2676,7 @@ static const aarch64_feature_set aarch64_feature_d128_the = #define D128 &aarch64_feature_d128 #define THE &aarch64_feature_the #define D128_THE &aarch64_feature_d128_the +#define B16B16 &aarch64_feature_b16b16 #define CORE_INSN(NAME,OPCODE,MASK,CLASS,OP,OPS,QUALS,FLAGS) \ { NAME, OPCODE, MASK, CLASS, OP, CORE, OPS, QUALS, FLAGS, 0, 0, NULL } @@ -2739,6 +2746,12 @@ static const aarch64_feature_set aarch64_feature_d128_the = #define SVE2_INSNC(NAME,OPCODE,MASK,CLASS,OP,OPS,QUALS,FLAGS,CONSTRAINTS,TIED) \ { NAME, OPCODE, MASK, CLASS, OP, SVE2, OPS, QUALS, \ FLAGS | F_STRICT, CONSTRAINTS, TIED, NULL } +#define B16B16_INSN(NAME,OPCODE,MASK,CLASS,OP,OPS,QUALS,FLAGS,TIED) \ + { NAME, OPCODE, MASK, CLASS, OP, B16B16, OPS, QUALS, \ + FLAGS | F_STRICT, 0, TIED, NULL } +#define B16B16_INSNC(NAME,OPCODE,MASK,CLASS,OP,OPS,QUALS,FLAGS,CONSTRAINTS,TIED) \ + { NAME, OPCODE, MASK, CLASS, OP, B16B16, OPS, QUALS, \ + FLAGS | F_STRICT, CONSTRAINTS, TIED, NULL } #define SVE2AES_INSN(NAME,OPCODE,MASK,CLASS,OP,OPS,QUALS,FLAGS,TIED) \ { NAME, OPCODE, MASK, CLASS, OP, SVE2_AES, OPS, QUALS, \ FLAGS | F_STRICT, 0, TIED, NULL } @@ -6258,6 +6271,24 @@ const struct aarch64_opcode aarch64_opcode_table[] = D128_THE_INSN("rcwsswppal", 0x59e0a000, 0xffe0fc00, OP3 (Rt, Rs, ADDR_SIMPLE), QL_X2NIL, 0), D128_THE_INSN("rcwsswppl", 0x5960a000, 0xffe0fc00, OP3 (Rt, Rs, ADDR_SIMPLE), QL_X2NIL, 0), +/* BFloat16 SVE Instructions. */ + B16B16_INSNC("bfadd", 0x65008000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfmax", 0x65068000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfmaxnm", 0x65048000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfmin", 0x65078000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfminnm", 0x65058000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfmla", 0x65200000, 0xffe0e000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zn, SVE_Zm_16), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSNC("bfmls", 0x65202000, 0xffe0e000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zn, SVE_Zm_16), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSN("bfadd", 0x65000000, 0xffe0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm_16), OP_SVE_HHH, 0, 0), + B16B16_INSN("bfclamp", 0x64202400, 0xffe0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm_16), OP_SVE_HHH, 0, 0), + B16B16_INSNC("bfmul", 0x65028000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSN("bfmul", 0x65000800, 0xffe0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm_16), OP_SVE_HHH, 0, 0), + B16B16_INSNC("bfsub", 0x65018000, 0xffffe000, sve_misc, 0, OP4 (SVE_Zd, SVE_Pg3, SVE_Zd, SVE_Zm_5), OP_SVE_SMSS, 0, C_SCAN_MOVPRFX, 0), + B16B16_INSN("bfsub", 0x65000400, 0xffe0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm_16), OP_SVE_HHH, 0, 0), + B16B16_INSN("bfmla", 0x64200800, 0xffa0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm3_11_INDEX), OP_SVE_VVV_H, 0, 0), + B16B16_INSN("bfmls", 0x64200c00, 0xffa0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm3_11_INDEX), OP_SVE_VVV_H, 0, 0), + B16B16_INSN("bfmul", 0x64202800, 0xffa0fc00, sve_misc, 0, OP3 (SVE_Zd, SVE_Zn, SVE_Zm3_11_INDEX), OP_SVE_VVV_H, 0, 0), + {0, 0, 0, 0, 0, 0, {}, {}, 0, 0, 0, NULL}, };