From patchwork Tue Oct 17 08:12:13 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Jakub Jelinek X-Patchwork-Id: 153960 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a05:612c:2908:b0:403:3b70:6f57 with SMTP id ib8csp3974177vqb; Tue, 17 Oct 2023 01:13:08 -0700 (PDT) X-Google-Smtp-Source: AGHT+IG5l1DB/k8CYuY4bteS/oE5Zix0FJz7LUThcye52SZ13k+WVO84IMSAzU/qeJcvKicIaDCb X-Received: by 2002:a05:622a:14c6:b0:417:af73:9eb9 with SMTP id u6-20020a05622a14c600b00417af739eb9mr1849093qtx.62.1697530387899; Tue, 17 Oct 2023 01:13:07 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1697530387; cv=pass; d=google.com; s=arc-20160816; b=DTUt0vMhzXWuDeG1Ymfs5l+QOOeUPxil1Xt402XXE8KjaPRPA6A3AquC6jHTnxjZhZ CNCXwWRxs6VzkmJo/2WXArmZ7PCDy9xFiP0q21AAy73DbjhnpD79t3FEMP/Fbd1YXSVm g+biZsGJvwo6noEim2P3S4XObebUYlI/CoPzg8kJ+B/ZTKZQFS2At8HSmuZdunWIPzcn XCpslcC+vvUj7sf5LO8g//w9wRoYMDTtuKN7AAroddK5eAHThMaCT0u1coe94Mt7GTYo kpV14/YdDzGpQRzDTTWSrmapEmS4D0Zq19AVRpDuom5gXdJh3bEudjOaR/AsqAOHI3ZN bNoA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=errors-to:reply-to:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:content-disposition :mime-version:message-id:subject:cc:to:from:date:dkim-signature :arc-filter:dmarc-filter:delivered-to; bh=Wf0o0l21MEhF9KfacrvmJTZhHp/DiB7njEkROOpaVIA=; fh=eM2nYbNDZ6Theupq9WaNJQV2QgkrsvQk+3X75erbI8c=; b=TG8r0XhkY+vi7l3HItg7n3aq0ttQvVxccyxU8MamagF0gekau6YP6f3OWswKbxDmLj o7iOmL+m7dTqKaSSiQg9DAve9WxMmxEmAIYWlSsDjR8TXq6aSIRDZJe+mYWmzu9xIJdc 9LKMCbAWoZBNhlVyMETswks+jJ/5ihyJWhUHovN41YuVZYJolq6Pn2rWOxjEbC4+XQ0M gbgAOLjty580Gya+r5PjzbLyzntNpm6s9nfXRE7PPmCi53Zg5K6slqd/GKuG0EJapGSt YblgJLF/I1mskiM0PUSZ7Nbbgfx6KJlxEY3aGnYyIeRYi3AyzZWQFnYHK3P7RZpITt2A ksQw== ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=JmLj06yu; arc=pass (i=1); spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from server2.sourceware.org (server2.sourceware.org. [2620:52:3:1:0:246e:9693:128c]) by mx.google.com with ESMTPS id d16-20020a05622a15d000b004181451a0ecsi779883qty.57.2023.10.17.01.13.07 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 17 Oct 2023 01:13:07 -0700 (PDT) Received-SPF: pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) client-ip=2620:52:3:1:0:246e:9693:128c; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=JmLj06yu; arc=pass (i=1); spf=pass (google.com: domain of gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org designates 2620:52:3:1:0:246e:9693:128c as permitted sender) smtp.mailfrom="gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id A0574385770B for ; Tue, 17 Oct 2023 08:13:07 +0000 (GMT) X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by sourceware.org (Postfix) with ESMTPS id F2BC33858C01 for ; Tue, 17 Oct 2023 08:12:27 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org F2BC33858C01 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=redhat.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=redhat.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org F2BC33858C01 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=170.10.133.124 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1697530350; cv=none; b=TdXcYOjqi0jJ9pcJkOm3XfWhWIe1xn2fcRkQg3Ds3G+wou6alozmwDhzuhID+k0q5shQkhdF2p2aeKOf4F3D/ip1pTgRPpxu2kqJfq21jku+LAKz9v6HpJ2XTK8T3pAqLVWvMupCmWmFZcAtkZhviZR3sEv7Rbo5ahQO+6sqoEs= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1697530350; c=relaxed/simple; bh=8UV8KIFc+ubVpXyDF/0JWUiguJId7p+tv6gFuHLGqAU=; h=DKIM-Signature:Date:From:To:Subject:Message-ID:MIME-Version; b=PmtjUO9GFg+pOhGNONw+IT6c3gn3NLtup4xHOnheXCaYMvz5PjBeBXOU+Cs+sjY4iZQMtq2PUhTkdv56isuS8pQK/Bv73UfYwuw7lJ7wqV4wv1Os9ddvFeUBOCTluj5aOu/rwwcohigvEqRA4X6enEtXVvDcRxXUdP4MQoTrJ6g= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1697530347; h=from:from:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type; bh=Wf0o0l21MEhF9KfacrvmJTZhHp/DiB7njEkROOpaVIA=; b=JmLj06yufnx4ki2o1+c9p2kJ+x4eABLree9oIr5FfxMyqM3zsMV3Hp2zDipmFAdox/QN9l fVf5XV5wlZkxRrDlOyo84A4B9sNw+mnJsEHyOydhuEsuGR3r0ftczkkxBh4IbAwQnHPESB ihjoHoQp/wVghsDnsxQV+En+x9Oy8L8= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-564-cdjxjoGaOieBplm_bU2kMw-1; Tue, 17 Oct 2023 04:12:18 -0400 X-MC-Unique: cdjxjoGaOieBplm_bU2kMw-1 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.rdu2.redhat.com [10.11.54.3]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 70F1385A5BF; Tue, 17 Oct 2023 08:12:18 +0000 (UTC) Received: from tucnak.zalov.cz (unknown [10.39.192.3]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 0F29A1121314; Tue, 17 Oct 2023 08:12:17 +0000 (UTC) Received: from tucnak.zalov.cz (localhost [127.0.0.1]) by tucnak.zalov.cz (8.17.1/8.17.1) with ESMTPS id 39H8CFUG338540 (version=TLSv1.3 cipher=TLS_AES_256_GCM_SHA384 bits=256 verify=NOT); Tue, 17 Oct 2023 10:12:15 +0200 Received: (from jakub@localhost) by tucnak.zalov.cz (8.17.1/8.17.1/Submit) id 39H8CEfG338539; Tue, 17 Oct 2023 10:12:14 +0200 Date: Tue, 17 Oct 2023 10:12:13 +0200 From: Jakub Jelinek To: Richard Biener , Richard Sandiford Cc: gcc-patches@gcc.gnu.org Subject: [PATCH] wide-int-print: Don't print large numbers hexadecimally for print_dec{,s,u} Message-ID: MIME-Version: 1.0 X-Scanned-By: MIMEDefang 3.4.1 on 10.11.54.3 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Disposition: inline X-Spam-Status: No, score=-3.4 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H3, RCVD_IN_MSPIKE_WL, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Jakub Jelinek Errors-To: gcc-patches-bounces+ouuuleilei=gmail.com@gcc.gnu.org X-getmail-retrieved-from-mailbox: INBOX X-GMAIL-THRID: 1779989623719804299 X-GMAIL-MSGID: 1779989623719804299 Hi! The following patch implements printing of wide_int/widest_int numbers decimally when asked for that using print_dec{,s,u}, even if they have precision larger than 64 and get_len () above 1 (right now we printed them hexadecimally and even negative numbers as huge positive hexadecimal). In order to avoid the expensive division/modulo by 10^19 twice, once to estimate how many will be needed and another to actually print it, the patch prints the 19 digit chunks in reverse order (from least significant to most significant) and then reorders those with linear complexity to form the right printed number. Tested with printing both 256 and 320 bit numbers (first as an example of even number of 19 digit chunks plus one shorter above it, the second as an example of odd number of 19 digit chunks plus one shorter above it). The l * HOST_BITS_PER_WIDE_INT / 3 + 3 estimatition thinking about it now is one byte too much (one byte for -, one for '\0') and too conservative, so we could go with l * HOST_BITS_PER_WIDE_INT / 3 + 2 as well, or e.g. l * HOST_BITS_PER_WIDE_INT * 10 / 33 + 3 as even less conservative estimation (though more expensive to compute in inline code). But that l * HOST_BITS_PER_WIDE_INT / 4 + 4; is likely one byte too much as well, 2 bytes for 0x, one byte for '\0' and where does the 4th one come from? Of course all of these assuming HOST_BITS_PER_WIDE_INT is a multiple of 64... Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk? 2023-10-17 Jakub Jelinek * wide-int-print.h (print_dec_buf_size): For length, divide number of bits by 3 and add 3 instead of division by 4 and adding 4. * wide-int-print.cc (print_decs): Remove superfluous ()s. Don't call print_hex, instead call print_decu on either negated value after printing - or on wi itself. (print_decu): Don't call print_hex, instead print even large numbers decimally. (pp_wide_int_large): Assume len from print_dec_buf_size is big enough even if it returns false. * pretty-print.h (pp_wide_int): Use print_dec_buf_size to check if pp_wide_int_large should be used. * tree-pretty-print.cc (dump_generic_node): Use print_hex_buf_size to compute needed buffer size. Jakub --- gcc/wide-int-print.h.jj 2023-10-15 23:04:06.195422820 +0200 +++ gcc/wide-int-print.h 2023-10-16 10:14:41.327401697 +0200 @@ -42,7 +42,7 @@ print_dec_buf_size (const wide_int_ref & unsigned int l = wi.get_len (); if ((l != 1 || sgn == UNSIGNED) && wi::neg_p (wi)) l = WIDE_INT_MAX_HWIS (wi.get_precision ()); - l = l * HOST_BITS_PER_WIDE_INT / 4 + 4; + l = l * HOST_BITS_PER_WIDE_INT / 3 + 3; *len = l; return UNLIKELY (l > WIDE_INT_PRINT_BUFFER_SIZE); } --- gcc/wide-int-print.cc.jj 2023-10-15 23:04:06.195422820 +0200 +++ gcc/wide-int-print.cc 2023-10-16 11:20:30.662174735 +0200 @@ -49,14 +49,12 @@ print_dec (const wide_int_ref &wi, FILE } -/* Try to print the signed self in decimal to BUF if the number fits - in a HWI. Other print in hex. */ +/* Try to print the signed self in decimal to BUF. */ void print_decs (const wide_int_ref &wi, char *buf) { - if ((wi.get_precision () <= HOST_BITS_PER_WIDE_INT) - || (wi.get_len () == 1)) + if (wi.get_precision () <= HOST_BITS_PER_WIDE_INT || wi.get_len () == 1) { if (wi::neg_p (wi)) sprintf (buf, "-" HOST_WIDE_INT_PRINT_UNSIGNED, @@ -64,12 +62,17 @@ print_decs (const wide_int_ref &wi, char else sprintf (buf, HOST_WIDE_INT_PRINT_DEC, wi.to_shwi ()); } + else if (wi::neg_p (wi)) + { + widest2_int w = widest2_int::from (wi, SIGNED); + *buf = '-'; + print_decu (-w, buf + 1); + } else - print_hex (wi, buf); + print_decu (wi, buf); } -/* Try to print the signed self in decimal to FILE if the number fits - in a HWI. Other print in hex. */ +/* Try to print the signed self in decimal to FILE. */ void print_decs (const wide_int_ref &wi, FILE *file) @@ -82,8 +85,7 @@ print_decs (const wide_int_ref &wi, FILE fputs (p, file); } -/* Try to print the unsigned self in decimal to BUF if the number fits - in a HWI. Other print in hex. */ +/* Try to print the unsigned self in decimal to BUF. */ void print_decu (const wide_int_ref &wi, char *buf) @@ -92,11 +94,37 @@ print_decu (const wide_int_ref &wi, char || (wi.get_len () == 1 && !wi::neg_p (wi))) sprintf (buf, HOST_WIDE_INT_PRINT_UNSIGNED, wi.to_uhwi ()); else - print_hex (wi, buf); + { + widest2_int w = widest2_int::from (wi, UNSIGNED), r; + widest2_int ten19 = HOST_WIDE_INT_UC (10000000000000000000); + char buf2[20], next1[19], next2[19]; + size_t l, c = 0, i; + /* In order to avoid dividing this twice, print the 19 decimal + digit chunks in reverse order into buffer and then reorder + them in-place. */ + while (wi::gtu_p (w, ten19)) + { + w = wi::divmod_trunc (w, ten19, UNSIGNED, &r); + sprintf (buf + c * 19, "%019" PRIu64, r.to_uhwi ()); + ++c; + } + l = sprintf (buf2, HOST_WIDE_INT_PRINT_UNSIGNED, w.to_uhwi ()); + buf[c * 19 + l] = '\0'; + memcpy (next1, buf, 19); + memcpy (buf, buf2, l); + for (i = 0; i < c / 2; ++i) + { + memcpy (next2, buf + (c - i - 1) * 19, 19); + memcpy (buf + l + (c - i - 1) * 19, next1, 19); + memcpy (next1, buf + (i + 1) * 19, 19); + memcpy (buf + l + i * 19, next2, 19); + } + if (c & 1) + memcpy (buf + l + i * 19, next1, 19); + } } -/* Try to print the signed self in decimal to FILE if the number fits - in a HWI. Other print in hex. */ +/* Try to print the signed self in decimal to FILE. */ void print_decu (const wide_int_ref &wi, FILE *file) @@ -155,8 +183,7 @@ void pp_wide_int_large (pretty_printer *pp, const wide_int_ref &w, signop sgn) { unsigned int len; - if (!print_dec_buf_size (w, sgn, &len)) - len = WIDE_INT_PRINT_BUFFER_SIZE; + print_dec_buf_size (w, sgn, &len); char *buf = XALLOCAVEC (char, len); print_dec (w, buf, sgn); pp_string (pp, buf); --- gcc/pretty-print.h.jj 2023-10-15 23:04:06.095422965 +0200 +++ gcc/pretty-print.h 2023-10-16 10:51:56.053529117 +0200 @@ -448,8 +448,9 @@ pp_wide_integer (pretty_printer *pp, HOS inline void pp_wide_int (pretty_printer *pp, const wide_int_ref &w, signop sgn) { - unsigned int prec = w.get_precision (); - if (UNLIKELY ((prec + 3) / 4 > sizeof (pp_buffer (pp)->digit_buffer) - 3)) + unsigned int len; + print_dec_buf_size (w, sgn, &len); + if (UNLIKELY (len > sizeof (pp_buffer (pp)->digit_buffer))) pp_wide_int_large (pp, w, sgn); else { --- gcc/tree-pretty-print.cc.jj 2023-09-21 20:02:53.467522151 +0200 +++ gcc/tree-pretty-print.cc 2023-10-16 11:05:51.131997367 +0200 @@ -2248,10 +2248,11 @@ dump_generic_node (pretty_printer *pp, t pp_minus (pp); val = -val; } - unsigned int prec = val.get_precision (); - if ((prec + 3) / 4 > sizeof (pp_buffer (pp)->digit_buffer) - 3) + unsigned int len; + print_hex_buf_size (val, &len); + if (UNLIKELY (len > sizeof (pp_buffer (pp)->digit_buffer))) { - char *buf = XALLOCAVEC (char, (prec + 3) / 4 + 3); + char *buf = XALLOCAVEC (char, len); print_hex (val, buf); pp_string (pp, buf); }