From patchwork Tue Oct 11 17:53:32 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Mark Harmstone X-Patchwork-Id: 1929 Return-Path: Delivered-To: ouuuleilei@gmail.com Received: by 2002:a5d:4ac7:0:0:0:0:0 with SMTP id y7csp2226358wrs; Tue, 11 Oct 2022 10:53:57 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7+THDO+IcXtXbPMDWvXJk9zlLXjdyfrqPt/N2AUpRnqVf13GN7M14igSkbK+f5oKe6IZTe X-Received: by 2002:a17:907:a065:b0:78d:c5dd:45cf with SMTP id ia5-20020a170907a06500b0078dc5dd45cfmr7269110ejc.117.1665510837597; Tue, 11 Oct 2022 10:53:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1665510837; cv=none; d=google.com; s=arc-20160816; b=S9BEvfqObX5RbaUqSBTHfqOFB85I9T42w18yRecIjFkXfSizcEiewZ/WEfA/d3+edH /r7LAWCoPkrGn0TGvnVb8DNrdq5NUZE0X9kC+yM/BMDTLOl1Qc7Y8eApLnKb4y+D+4OM rHc54414hjwSGjiY/JNEPq8o+6J/qywZjt0cgfZ+edVFyXRLreuqG8qJPHhSCkAC7+aE tNfOlgkY/j/l0FVeEpMO4D6jH1C9H9tRJWFpS53kCfVR38sE4SEAdbvTIlPSS2Pdd424 OwSaX7jtUxOiVzgX/Lthodh61mFncpKbTloptBcZLCm5paj5kh1xOXNuPN3JgFp72+6n qgbg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=sender:errors-to:cc:list-subscribe:list-help:list-post:list-archive :list-unsubscribe:list-id:precedence:content-transfer-encoding :mime-version:references:in-reply-to:message-id:date:subject:to:from :dkim-signature:dmarc-filter:delivered-to; bh=TV5j9qZADk6aw1AiVqaKO81gLzszcTHDQZQrWcPw/u8=; b=d2nICpEapvf3+pY18NxcJD23SwUEzbFCg2DN+I7xrJluOm7di7RvsGlIFXJolmLM24 ClvsmH5QvPAOi2Bch+ZVKPCPUBUvsaAKI7S/Dpx4rm6I48elMrunH41JvqiBU5AMc3wz 5ZhPCof0Gqs272ct4P0MYjKnYONn2zi6qRoQwjEyhg1gDWX/mL9+ZFqXPkItTMuESsi4 PcssU+Al5GqeDeWmi29MfHjH2yfKXZ+CHJaKx11v1bIeqL7Sv/vPTvtSh/cGHvlfpvJK uq+9oruLZWCAKSHYZvqsPz1apyBeTR/8QlF2B0Gi8lkQJ9mfky8UfduQdHtg3tIg1I/T 0Fgw== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20210112 header.b=MDu5YSL6; spf=pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="binutils-bounces+ouuuleilei=gmail.com@sourceware.org" Received: from sourceware.org (ip-8-43-85-97.sourceware.org. [8.43.85.97]) by mx.google.com with ESMTPS id sd25-20020a1709076e1900b0078df1c345e4si76723ejc.518.2022.10.11.10.53.57 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 11 Oct 2022 10:53:57 -0700 (PDT) Received-SPF: pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 8.43.85.97 as permitted sender) client-ip=8.43.85.97; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20210112 header.b=MDu5YSL6; spf=pass (google.com: domain of binutils-bounces+ouuuleilei=gmail.com@sourceware.org designates 8.43.85.97 as permitted sender) smtp.mailfrom="binutils-bounces+ouuuleilei=gmail.com@sourceware.org" Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 11092385AC26 for ; Tue, 11 Oct 2022 17:53:50 +0000 (GMT) X-Original-To: binutils@sourceware.org Delivered-To: binutils@sourceware.org Received: from mail-wr1-x433.google.com (mail-wr1-x433.google.com [IPv6:2a00:1450:4864:20::433]) by sourceware.org (Postfix) with ESMTPS id 65B5B385840F for ; Tue, 11 Oct 2022 17:53:40 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 65B5B385840F Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=harmstone.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-wr1-x433.google.com with SMTP id bv10so19235673wrb.4 for ; Tue, 11 Oct 2022 10:53:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:sender:from:to:cc:subject:date :message-id:reply-to; bh=TV5j9qZADk6aw1AiVqaKO81gLzszcTHDQZQrWcPw/u8=; b=MDu5YSL6cqEFrwFllgThg5iBShYuLi/4QqP4Jqw3Rfl2GkmJ4IVBRDLM4cjzmuOrWQ 83C5JhufRPrfQ3xbDJ4PMG6Qu/vpKQkR1hjfENfqktaqZhycwQClPfWvTH/YfVTPhs52 Bo///uUYIrUnaPL/Q+kFM1zIKuy7bzqg16lyi2YPTTEB5WFIX/49irFDH+rubofyUQQs h2h/CbOW6sY5+qsBKMCFYDwIKZj56/hfDRyddwAPaAbIqBb/lmr+QES/FverkFxY5n50 IuzG40c6HPesEQ2gf7WjWNTyAkUhIiYf0D0LNoOcdH35hWH65TOrasbEjnbAd7TA30NJ fL5w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:sender:x-gm-message-state:from :to:cc:subject:date:message-id:reply-to; bh=TV5j9qZADk6aw1AiVqaKO81gLzszcTHDQZQrWcPw/u8=; b=rx8mqPS3Ppisdyv65+b4lDcDIPh5QzEXQz6Bp4np3nyVCUR01JUgMt6WfnpPt4Fytl CaFrACZ8XP7Ow+nXL4xrdSwoE+cBzdKKG3zE3GX/AnrefTSWExq0rbD6zaYHjKiRTX++ nX1XuR63O0qgucrvQoCqUTQyFtRmi3sY30vcB3W7Ya5UjEYHWY5gmiBLkV6R0WRxn8Lr PLhg/2h1IYpwjdz3OWPH9hrqItRFR6KeHYdpvVNEdtIsbMSpwWcqcX760qbrOx5oV/Zq PmwoexMC8yJ4EwCF9wKj2W/FRj/q3h4SrE6cRFF3SwujS2clazEk/1OI6wQqXucuQKqe ANRQ== X-Gm-Message-State: ACrzQf0fX4mvi+P4BbNgf1ghaDRCM+TrZUyJOnYx24TiBKUpEitq983e b7SKPUaHb9bfH4P2UtV/IIInQpVczi4= X-Received: by 2002:a05:6000:1b06:b0:22c:da6d:b199 with SMTP id f6-20020a0560001b0600b0022cda6db199mr15808420wrz.584.1665510818590; Tue, 11 Oct 2022 10:53:38 -0700 (PDT) Received: from beren.harmstone.com ([2a02:8010:64ea:0:8eb8:7eff:fe53:9d5f]) by smtp.gmail.com with ESMTPSA id n11-20020a05600c3b8b00b003a540fef440sm21520463wms.1.2022.10.11.10.53.37 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 11 Oct 2022 10:53:38 -0700 (PDT) From: Mark Harmstone To: binutils@sourceware.org, martin@martin.st Subject: [PATCH v4 2/2] ld: Add minimal pdb generation Date: Tue, 11 Oct 2022 18:53:32 +0100 Message-Id: <20221011175332.17156-2-mark@harmstone.com> X-Mailer: git-send-email 2.35.1 In-Reply-To: <20221011175332.17156-1-mark@harmstone.com> References: <20221011175332.17156-1-mark@harmstone.com> MIME-Version: 1.0 X-Spam-Status: No, score=-11.5 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_EF, FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM, GIT_PATCH_0, HEADER_FROM_DIFFERENT_DOMAINS, KAM_SHORT, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: binutils@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Binutils mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Mark Harmstone Errors-To: binutils-bounces+ouuuleilei=gmail.com@sourceware.org Sender: "Binutils" X-getmail-retrieved-from-mailbox: =?utf-8?q?INBOX?= X-GMAIL-THRID: =?utf-8?q?1746414692114156574?= X-GMAIL-MSGID: =?utf-8?q?1746414692114156574?= --- ld/Makefile.am | 10 +- ld/Makefile.in | 13 +- ld/emultempl/pe.em | 7 + ld/emultempl/pep.em | 7 + ld/pdb.c | 516 +++++++++++++++++++++++++++++++++++++ ld/pdb.h | 111 ++++++++ ld/testsuite/ld-pe/pdb.exp | 267 +++++++++++++++++++ 7 files changed, 922 insertions(+), 9 deletions(-) create mode 100644 ld/pdb.c create mode 100644 ld/pdb.h diff --git a/ld/Makefile.am b/ld/Makefile.am index d31021c13e2..43114372989 100644 --- a/ld/Makefile.am +++ b/ld/Makefile.am @@ -475,12 +475,14 @@ ALL_64_EMUL_EXTRA_OFILES = \ CFILES = ldctor.c ldemul.c ldexp.c ldfile.c ldlang.c \ ldmain.c ldmisc.c ldver.c ldwrite.c lexsup.c \ mri.c ldcref.c pe-dll.c pep-dll.c ldlex-wrapper.c \ - plugin.c ldbuildid.c ldelf.c ldelfgen.c + plugin.c ldbuildid.c ldelf.c ldelfgen.c \ + pdb.c HFILES = ld.h ldctor.h ldemul.h ldexp.h ldfile.h \ ldlang.h ldlex.h ldmain.h ldmisc.h ldver.h \ ldwrite.h mri.h deffile.h pe-dll.h pep-dll.h \ - elf-hints-local.h plugin.h ldbuildid.h ldelf.h ldelfgen.h + elf-hints-local.h plugin.h ldbuildid.h ldelf.h ldelfgen.h \ + pdb.h GENERATED_CFILES = ldgram.c ldlex.c deffilep.c GENERATED_HFILES = ldgram.h ldemul-list.h deffilep.h @@ -493,7 +495,7 @@ OFILES = ldgram.@OBJEXT@ ldlex-wrapper.@OBJEXT@ lexsup.@OBJEXT@ ldlang.@OBJEXT@ mri.@OBJEXT@ ldctor.@OBJEXT@ ldmain.@OBJEXT@ plugin.@OBJEXT@ \ ldwrite.@OBJEXT@ ldexp.@OBJEXT@ ldemul.@OBJEXT@ ldver.@OBJEXT@ ldmisc.@OBJEXT@ \ ldfile.@OBJEXT@ ldcref.@OBJEXT@ ${EMULATION_OFILES} ${EMUL_EXTRA_OFILES} \ - ldbuildid.@OBJEXT@ + ldbuildid.@OBJEXT@ pdb.@OBJEXT@ STAGESTUFF = *.@OBJEXT@ ldscripts/* e*.c @@ -956,7 +958,7 @@ EXTRA_ld_new_SOURCES += pep-dll.c pe-dll.c ldelf.c ldelfgen.c ld_new_SOURCES = ldgram.y ldlex-wrapper.c lexsup.c ldlang.c mri.c ldctor.c ldmain.c \ ldwrite.c ldexp.c ldemul.c ldver.c ldmisc.c ldfile.c ldcref.c plugin.c \ - ldbuildid.c + ldbuildid.c pdb.c ld_new_DEPENDENCIES = $(EMULATION_OFILES) $(EMUL_EXTRA_OFILES) \ $(BFDLIB) $(LIBCTF) $(LIBIBERTY) $(LIBINTL_DEP) $(JANSSON_LIBS) ld_new_LDADD = $(EMULATION_OFILES) $(EMUL_EXTRA_OFILES) $(BFDLIB) $(LIBCTF) $(LIBIBERTY) $(LIBINTL) $(ZLIB) $(JANSSON_LIBS) diff --git a/ld/Makefile.in b/ld/Makefile.in index ee0c98f65b0..7ce4704c66f 100644 --- a/ld/Makefile.in +++ b/ld/Makefile.in @@ -211,7 +211,7 @@ am_ld_new_OBJECTS = ldgram.$(OBJEXT) ldlex-wrapper.$(OBJEXT) \ ldctor.$(OBJEXT) ldmain.$(OBJEXT) ldwrite.$(OBJEXT) \ ldexp.$(OBJEXT) ldemul.$(OBJEXT) ldver.$(OBJEXT) \ ldmisc.$(OBJEXT) ldfile.$(OBJEXT) ldcref.$(OBJEXT) \ - plugin.$(OBJEXT) ldbuildid.$(OBJEXT) + plugin.$(OBJEXT) ldbuildid.$(OBJEXT) pdb.$(OBJEXT) ld_new_OBJECTS = $(am_ld_new_OBJECTS) am__DEPENDENCIES_1 = @ENABLE_LIBCTF_TRUE@am__DEPENDENCIES_2 = ../libctf/libctf.la @@ -970,12 +970,14 @@ ALL_64_EMUL_EXTRA_OFILES = \ CFILES = ldctor.c ldemul.c ldexp.c ldfile.c ldlang.c \ ldmain.c ldmisc.c ldver.c ldwrite.c lexsup.c \ mri.c ldcref.c pe-dll.c pep-dll.c ldlex-wrapper.c \ - plugin.c ldbuildid.c ldelf.c ldelfgen.c + plugin.c ldbuildid.c ldelf.c ldelfgen.c \ + pdb.c HFILES = ld.h ldctor.h ldemul.h ldexp.h ldfile.h \ ldlang.h ldlex.h ldmain.h ldmisc.h ldver.h \ ldwrite.h mri.h deffile.h pe-dll.h pep-dll.h \ - elf-hints-local.h plugin.h ldbuildid.h ldelf.h ldelfgen.h + elf-hints-local.h plugin.h ldbuildid.h ldelf.h ldelfgen.h \ + pdb.h GENERATED_CFILES = ldgram.c ldlex.c deffilep.c GENERATED_HFILES = ldgram.h ldemul-list.h deffilep.h @@ -987,7 +989,7 @@ OFILES = ldgram.@OBJEXT@ ldlex-wrapper.@OBJEXT@ lexsup.@OBJEXT@ ldlang.@OBJEXT@ mri.@OBJEXT@ ldctor.@OBJEXT@ ldmain.@OBJEXT@ plugin.@OBJEXT@ \ ldwrite.@OBJEXT@ ldexp.@OBJEXT@ ldemul.@OBJEXT@ ldver.@OBJEXT@ ldmisc.@OBJEXT@ \ ldfile.@OBJEXT@ ldcref.@OBJEXT@ ${EMULATION_OFILES} ${EMUL_EXTRA_OFILES} \ - ldbuildid.@OBJEXT@ + ldbuildid.@OBJEXT@ pdb.@OBJEXT@ STAGESTUFF = *.@OBJEXT@ ldscripts/* e*.c SRC_POTFILES = $(CFILES) $(HFILES) @@ -1006,7 +1008,7 @@ EXTRA_ld_new_SOURCES = deffilep.y ldlex.l pep-dll.c pe-dll.c ldelf.c \ $(ALL_64_EMULATION_SOURCES) ld_new_SOURCES = ldgram.y ldlex-wrapper.c lexsup.c ldlang.c mri.c ldctor.c ldmain.c \ ldwrite.c ldexp.c ldemul.c ldver.c ldmisc.c ldfile.c ldcref.c plugin.c \ - ldbuildid.c + ldbuildid.c pdb.c ld_new_DEPENDENCIES = $(EMULATION_OFILES) $(EMUL_EXTRA_OFILES) \ $(BFDLIB) $(LIBCTF) $(LIBIBERTY) $(LIBINTL_DEP) $(JANSSON_LIBS) @@ -1569,6 +1571,7 @@ distclean-compile: @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/libldtestplug4_la-testplug4.Plo@am__quote@ @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/libldtestplug_la-testplug.Plo@am__quote@ @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/mri.Po@am__quote@ +@AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/pdb.Po@am__quote@ @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/pe-dll.Po@am__quote@ @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/pep-dll.Po@am__quote@ @AMDEP_TRUE@@am__include@ @am__quote@./$(DEPDIR)/plugin.Po@am__quote@ diff --git a/ld/emultempl/pe.em b/ld/emultempl/pe.em index 5df818c746c..b324b629965 100644 --- a/ld/emultempl/pe.em +++ b/ld/emultempl/pe.em @@ -66,6 +66,7 @@ fragment < + +struct public +{ + struct public *next; + uint32_t offset; + uint32_t hash; + unsigned int index; + uint16_t section; + uint32_t address; +}; + +/* Add a new stream to the PDB archive, and return its BFD. */ +static bfd * +add_stream (bfd *pdb, const char *name, uint16_t *stream_num) +{ + bfd *stream; + uint16_t num; + + stream = bfd_create (name ? name : "", pdb); + if (!stream) + return NULL; + + if (!bfd_make_writable (stream)) + { + bfd_close (stream); + return false; + } + + if (!pdb->archive_head) + { + bfd_set_archive_head (pdb, stream); + num = 0; + } + else + { + bfd *b = pdb->archive_head; + + num = 1; + + while (b->archive_next) + { + num++; + b = b->archive_next; + } + + b->archive_next = stream; + } + + if (stream_num) + *stream_num = num; + + return stream; +} + +/* Stream 0 ought to be a copy of the MSF directory from the last + time the PDB file was written. Because we don't do incremental + writes this isn't applicable to us, but we fill it with a dummy + value so as not to confuse radare. */ +static bool +create_old_directory_stream (bfd *pdb) +{ + bfd *stream; + char buf[sizeof (uint32_t)]; + + stream = add_stream (pdb, NULL, NULL); + if (!stream) + return false; + + bfd_putl32 (0, buf); + + return bfd_bwrite (buf, sizeof (uint32_t), stream) == sizeof (uint32_t); +} + +/* Calculate the hash of a given string. */ +static uint32_t +calc_hash (const char *data, size_t len) +{ + uint32_t hash = 0; + + while (len >= 4) + { + hash ^= *(uint32_t *) data; + data += 4; + len -= 4; + } + + if (len >= 2) + { + hash ^= *(uint16_t *) data; + data += 2; + len -= 2; + } + + if (len != 0) + hash ^= *data; + + hash |= 0x20202020; + hash ^= (hash >> 11); + + return hash ^ (hash >> 16); +} + +/* Stream 1 is the PDB info stream - see + https://llvm.org/docs/PDB/PdbStream.html. */ +static bool +populate_info_stream (bfd *pdb, bfd *info_stream, const unsigned char *guid) +{ + bool ret = false; + struct pdb_stream_70 h; + uint32_t num_entries, num_buckets; + uint32_t names_length, stream_num; + char int_buf[sizeof (uint32_t)]; + + struct hash_entry + { + uint32_t offset; + uint32_t value; + }; + + struct hash_entry **buckets = NULL; + + /* Write header. */ + + bfd_putl32 (PDB_STREAM_VERSION_VC70, &h.version); + bfd_putl32 (time (NULL), &h.signature); + bfd_putl32 (1, &h.age); + + bfd_putl32 (bfd_getb32 (guid), h.guid); + bfd_putl16 (bfd_getb16 (&guid[4]), &h.guid[4]); + bfd_putl16 (bfd_getb16 (&guid[6]), &h.guid[6]); + memcpy (&h.guid[8], &guid[8], 8); + + if (bfd_bwrite (&h, sizeof (h), info_stream) != sizeof (h)) + return false; + + /* Write hash list of named streams. This is a "rollover" hash, i.e. + if a bucket is filled an entry gets placed in the next free + slot. */ + + num_entries = 0; + for (bfd *b = pdb->archive_head; b; b = b->archive_next) + { + if (strcmp (b->filename, "")) + num_entries++; + } + + num_buckets = num_entries * 2; + + names_length = 0; + stream_num = 0; + + if (num_buckets > 0) + { + buckets = xmalloc (sizeof (struct hash_entry *) * num_buckets); + memset (buckets, 0, sizeof (struct hash_entry *) * num_buckets); + + for (bfd *b = pdb->archive_head; b; b = b->archive_next) + { + if (strcmp (b->filename, "")) + { + size_t len = strlen (b->filename); + uint32_t hash = (uint16_t) calc_hash (b->filename, len); + uint32_t bucket_num = hash % num_buckets; + + while (buckets[bucket_num]) + { + bucket_num++; + + if (bucket_num == num_buckets) + bucket_num = 0; + } + + buckets[bucket_num] = xmalloc (sizeof (struct hash_entry)); + + buckets[bucket_num]->offset = names_length; + buckets[bucket_num]->value = stream_num; + + names_length += len + 1; + } + + stream_num++; + } + } + + /* Write the strings list - the hash keys are indexes into this. */ + + bfd_putl32 (names_length, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + for (bfd *b = pdb->archive_head; b; b = b->archive_next) + { + if (!strcmp (b->filename, "")) + continue; + + size_t len = strlen (b->filename) + 1; + + if (bfd_bwrite (b->filename, len, info_stream) != len) + goto end; + } + + /* Write the number of entries and buckets. */ + + bfd_putl32 (num_entries, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + bfd_putl32 (num_buckets, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + /* Write the present bitmap. */ + + bfd_putl32 ((num_buckets + 31) / 32, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + for (unsigned int i = 0; i < num_buckets; i += 32) + { + uint32_t v = 0; + + for (unsigned int j = 0; j < 32; j++) + { + if (i + j >= num_buckets) + break; + + if (buckets[i + j]) + v |= 1 << j; + } + + bfd_putl32 (v, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + } + + /* Write the (empty) deleted bitmap. */ + + bfd_putl32 (0, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + /* Write the buckets. */ + + for (unsigned int i = 0; i < num_buckets; i++) + { + if (buckets[i]) + { + bfd_putl32 (buckets[i]->offset, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + bfd_putl32 (buckets[i]->value, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + } + } + + bfd_putl32 (0, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + bfd_putl32 (PDB_STREAM_VERSION_VC140, int_buf); + + if (bfd_bwrite (int_buf, sizeof (uint32_t), info_stream) != + sizeof (uint32_t)) + goto end; + + ret = true; + +end: + for (unsigned int i = 0; i < num_buckets; i++) + { + if (buckets[i]) + free (buckets[i]); + } + + free (buckets); + + return ret; +} + +/* Stream 2 is the type information (TPI) stream, and stream 4 is + the ID information (IPI) stream. They differ only in which records + go in which stream. */ +static bool +create_type_stream (bfd *pdb) +{ + bfd *stream; + struct pdb_tpi_stream_header h; + + stream = add_stream (pdb, NULL, NULL); + if (!stream) + return false; + + bfd_putl32 (TPI_STREAM_VERSION_80, &h.version); + bfd_putl32 (sizeof (h), &h.header_size); + bfd_putl32 (TPI_FIRST_INDEX, &h.type_index_begin); + bfd_putl32 (TPI_FIRST_INDEX, &h.type_index_end); + bfd_putl32 (0, &h.type_record_bytes); + bfd_putl16 (0xffff, &h.hash_stream_index); + bfd_putl16 (0xffff, &h.hash_aux_stream_index); + bfd_putl32 (4, &h.hash_key_size); + bfd_putl32 (0x3ffff, &h.num_hash_buckets); + bfd_putl32 (0, &h.hash_value_buffer_offset); + bfd_putl32 (0, &h.hash_value_buffer_length); + bfd_putl32 (0, &h.index_offset_buffer_offset); + bfd_putl32 (0, &h.index_offset_buffer_length); + bfd_putl32 (0, &h.hash_adj_buffer_offset); + bfd_putl32 (0, &h.hash_adj_buffer_length); + + if (bfd_bwrite (&h, sizeof (h), stream) != sizeof (h)) + return false; + + return true; +} + +/* Return the PE architecture number for the image. */ +static uint16_t +get_arch_number (bfd *abfd) +{ + if (abfd->arch_info->arch != bfd_arch_i386) + return 0; + + if (abfd->arch_info->mach & bfd_mach_x86_64) + return IMAGE_FILE_MACHINE_AMD64; + + return IMAGE_FILE_MACHINE_I386; +} + +/* Stream 4 is the debug information (DBI) stream. */ +static bool +populate_dbi_stream (bfd *stream, bfd *abfd) +{ + struct pdb_dbi_stream_header h; + struct optional_dbg_header opt; + + bfd_putl32 (0xffffffff, &h.version_signature); + bfd_putl32 (DBI_STREAM_VERSION_70, &h.version_header); + bfd_putl32 (1, &h.age); + bfd_putl16 (0xffff, &h.global_stream_index); + bfd_putl16 (0x8e1d, &h.build_number); // MSVC 14.29 + bfd_putl16 (0xffff, &h.public_stream_index); + bfd_putl16 (0, &h.pdb_dll_version); + bfd_putl16 (0xffff, &h.sym_record_stream); + bfd_putl16 (0, &h.pdb_dll_rbld); + bfd_putl32 (0, &h.mod_info_size); + bfd_putl32 (0, &h.section_contribution_size); + bfd_putl32 (0, &h.section_map_size); + bfd_putl32 (0, &h.source_info_size); + bfd_putl32 (0, &h.type_server_map_size); + bfd_putl32 (0, &h.mfc_type_server_index); + bfd_putl32 (sizeof (opt), &h.optional_dbg_header_size); + bfd_putl32 (0, &h.ec_substream_size); + bfd_putl16 (0, &h.flags); + bfd_putl16 (get_arch_number (abfd), &h.machine); + bfd_putl32 (0, &h.padding); + + if (bfd_bwrite (&h, sizeof (h), stream) != sizeof (h)) + return false; + + bfd_putl16 (0xffff, &opt.fpo_stream); + bfd_putl16 (0xffff, &opt.exception_stream); + bfd_putl16 (0xffff, &opt.fixup_stream); + bfd_putl16 (0xffff, &opt.omap_to_src_stream); + bfd_putl16 (0xffff, &opt.omap_from_src_stream); + bfd_putl16 (0xffff, &opt.section_header_stream); + bfd_putl16 (0xffff, &opt.token_map_stream); + bfd_putl16 (0xffff, &opt.xdata_stream); + bfd_putl16 (0xffff, &opt.pdata_stream); + bfd_putl16 (0xffff, &opt.new_fpo_stream); + bfd_putl16 (0xffff, &opt.orig_section_header_stream); + + if (bfd_bwrite (&opt, sizeof (opt), stream) != sizeof (opt)) + return false; + + return true; +} + +/* Create a PDB debugging file for the PE image file abfd with the build ID + guid, stored at pdb_name. */ +bool +create_pdb_file (bfd *abfd, const char *pdb_name, const unsigned char *guid) +{ + bfd *pdb; + bool ret = false; + bfd *info_stream, *dbi_stream, *names_stream; + + pdb = bfd_openw (pdb_name, "pdb"); + if (!pdb) + { + einfo (_("%P: warning: cannot create PDB file: %s\n"), + bfd_errmsg (bfd_get_error ())); + return false; + } + + bfd_set_format (pdb, bfd_archive); + + if (!create_old_directory_stream (pdb)) + { + einfo (_("%P: warning: cannot create old directory stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + info_stream = add_stream (pdb, NULL, NULL); + + if (!info_stream) + { + einfo (_("%P: warning: cannot create info stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + if (!create_type_stream (pdb)) + { + einfo (_("%P: warning: cannot create TPI stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + dbi_stream = add_stream (pdb, NULL, NULL); + + if (!dbi_stream) + { + einfo (_("%P: warning: cannot create DBI stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + if (!create_type_stream (pdb)) + { + einfo (_("%P: warning: cannot create IPI stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + names_stream = add_stream (pdb, "/names", NULL); + + if (!names_stream) + { + einfo (_("%P: warning: cannot create /names stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + if (!populate_dbi_stream (dbi_stream, abfd)) + { + einfo (_("%P: warning: cannot populate DBI stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + if (!populate_info_stream (pdb, info_stream, guid)) + { + einfo (_("%P: warning: cannot populate info stream " + "in PDB file: %s\n"), bfd_errmsg (bfd_get_error ())); + goto end; + } + + ret = true; + +end: + bfd_close (pdb); + + return ret; +} diff --git a/ld/pdb.h b/ld/pdb.h new file mode 100644 index 00000000000..e5f53b44f39 --- /dev/null +++ b/ld/pdb.h @@ -0,0 +1,111 @@ +/* pdb.h - header file for generating PDB CodeView debugging files. + Copyright (C) 2022 Free Software Foundation, Inc. + + This file is part of the GNU Binutils. + + This program is free software; you can redistribute it and/or modify + it under the terms of the GNU General Public License as published by + the Free Software Foundation; either version 3 of the License, or + (at your option) any later version. + + This program is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the + GNU General Public License for more details. + + You should have received a copy of the GNU General Public License + along with this program; if not, write to the Free Software + Foundation, Inc., 51 Franklin Street - Fifth Floor, Boston, + MA 02110-1301, USA. */ + +/* Header files referred to below can be found in Microsoft's PDB + repository: https://github.com/microsoft/microsoft-pdb. */ + +#ifndef PDB_H +#define PDB_H + +#include "sysdep.h" +#include "bfd.h" +#include + +/* PDBStream70 in pdb1.h */ +struct pdb_stream_70 +{ + uint32_t version; + uint32_t signature; + uint32_t age; + uint8_t guid[16]; +}; + +#define PDB_STREAM_VERSION_VC70 20000404 +#define PDB_STREAM_VERSION_VC140 20140508 + +/* HDR in tpi.h */ +struct pdb_tpi_stream_header +{ + uint32_t version; + uint32_t header_size; + uint32_t type_index_begin; + uint32_t type_index_end; + uint32_t type_record_bytes; + uint16_t hash_stream_index; + uint16_t hash_aux_stream_index; + uint32_t hash_key_size; + uint32_t num_hash_buckets; + uint32_t hash_value_buffer_offset; + uint32_t hash_value_buffer_length; + uint32_t index_offset_buffer_offset; + uint32_t index_offset_buffer_length; + uint32_t hash_adj_buffer_offset; + uint32_t hash_adj_buffer_length; +}; + +#define TPI_STREAM_VERSION_80 20040203 + +#define TPI_FIRST_INDEX 0x1000 + +/* NewDBIHdr in dbi.h */ +struct pdb_dbi_stream_header +{ + uint32_t version_signature; + uint32_t version_header; + uint32_t age; + uint16_t global_stream_index; + uint16_t build_number; + uint16_t public_stream_index; + uint16_t pdb_dll_version; + uint16_t sym_record_stream; + uint16_t pdb_dll_rbld; + uint32_t mod_info_size; + uint32_t section_contribution_size; + uint32_t section_map_size; + uint32_t source_info_size; + uint32_t type_server_map_size; + uint32_t mfc_type_server_index; + uint32_t optional_dbg_header_size; + uint32_t ec_substream_size; + uint16_t flags; + uint16_t machine; + uint32_t padding; +}; + +#define DBI_STREAM_VERSION_70 19990903 + +struct optional_dbg_header +{ + uint16_t fpo_stream; + uint16_t exception_stream; + uint16_t fixup_stream; + uint16_t omap_to_src_stream; + uint16_t omap_from_src_stream; + uint16_t section_header_stream; + uint16_t token_map_stream; + uint16_t xdata_stream; + uint16_t pdata_stream; + uint16_t new_fpo_stream; + uint16_t orig_section_header_stream; +}; + +extern bool create_pdb_file (bfd *, const char *, const unsigned char *); + +#endif diff --git a/ld/testsuite/ld-pe/pdb.exp b/ld/testsuite/ld-pe/pdb.exp index 1560241cdb8..b62ce6da6f8 100644 --- a/ld/testsuite/ld-pe/pdb.exp +++ b/ld/testsuite/ld-pe/pdb.exp @@ -35,6 +35,249 @@ proc get_pdb_name { pe } { return $pdb } +proc get_pdb_guid { pe } { + global OBJDUMP + + set exec_output [run_host_cmd "$OBJDUMP" "-p $pe"] + + if ![regexp -line "^\\(format RSDS signature (\[0-9a-fA-F\]{32}) age 1 pdb (.*)\\)$" $exec_output full sig pdb] { + return "" + } + + return $sig +} + +proc check_pdb_info_stream { pdb guid } { + global ar + + set exec_output [run_host_cmd "$ar" "x --output tmpdir $pdb 0001"] + + if ![string match "" $exec_output] { + return 0 + } + + set fi [open tmpdir/0001] + fconfigure $fi -translation binary + + # check version + + set data [read $fi 4] + binary scan $data i version + + if { $version != 20000404 } { + close $fi + return 0 + } + + # skip signature (timestamp) + read $fi 4 + + # check age + + set data [read $fi 4] + binary scan $data i age + + if { $age != 1 } { + close $fi + return 0 + } + + # check GUID + + set data [read $fi 16] + binary scan $data H2H2H2H2H2H2H2H2H* guid1 guid2 guid3 guid4 guid5 guid6 guid7 guid8 guid9 + + set data "$guid4$guid3$guid2$guid1$guid6$guid5$guid8$guid7$guid9" + + if { $data ne $guid } { + close $fi + return 0 + } + + # skip names string + + set data [read $fi 4] + binary scan $data i names_length + read $fi $names_length + + # read number of names entries + + set data [read $fi 4] + binary scan $data i num_entries + + # skip number of buckets + read $fi 4 + + # skip present bitmap + + set data [read $fi 4] + binary scan $data i bitmap_length + read $fi [expr $bitmap_length * 4] + + # skip deleted bitmap + + set data [read $fi 4] + binary scan $data i bitmap_length + read $fi [expr $bitmap_length * 4] + + # skip names entries + read $fi [expr $num_entries * 8] + + # skip uint32_t + read $fi 4 + + # read second version + + set data [read $fi 4] + binary scan $data i version2 + + if { $version2 != 20140508 } { + close $fi + return 0 + } + + close $fi + + return 1 +} + +proc check_type_stream { pdb stream } { + global ar + + set exec_output [run_host_cmd "$ar" "x --output tmpdir $pdb $stream"] + + if ![string match "" $exec_output] { + return 0 + } + + set fi [open tmpdir/$stream] + fconfigure $fi -translation binary + + # check version + + set data [read $fi 4] + binary scan $data i version + + if { $version != 20040203 } { + close $fi + return 0 + } + + # check header size + + set data [read $fi 4] + binary scan $data i header_size + + if { $header_size != 0x38 } { + close $fi + return 0 + } + + # skip type_index_begin and type_index_end + read $fi 8 + + # read type_record_bytes + + set data [read $fi 4] + binary scan $data i type_record_bytes + + close $fi + + # check stream length + + set stream_length [file size tmpdir/$stream] + + if { $stream_length != [ expr $header_size + $type_record_bytes ] } { + return 0 + } + + return 1 +} + +proc check_dbi_stream { pdb } { + global ar + + set exec_output [run_host_cmd "$ar" "x --output tmpdir $pdb 0003"] + + if ![string match "" $exec_output] { + return 0 + } + + set fi [open tmpdir/0003] + fconfigure $fi -translation binary + + # check signature + + set data [read $fi 4] + binary scan $data i signature + + if { $signature != -1 } { + close $fi + return 0 + } + + # check version + + set data [read $fi 4] + binary scan $data i version + + if { $version != 19990903 } { + close $fi + return 0 + } + + # check age + + set data [read $fi 4] + binary scan $data i age + + if { $age != 1 } { + close $fi + return 0 + } + + # skip fields + read $fi 12 + + # read substream sizes + + set data [read $fi 4] + binary scan $data i mod_info_size + + set data [read $fi 4] + binary scan $data i section_contribution_size + + set data [read $fi 4] + binary scan $data i section_map_size + + set data [read $fi 4] + binary scan $data i source_info_size + + set data [read $fi 4] + binary scan $data i type_server_map_size + + set data [read $fi 4] + binary scan $data i mfc_type_server_index + + set data [read $fi 4] + binary scan $data i optional_dbg_header_size + + set data [read $fi 4] + binary scan $data i ec_substream_size + + close $fi + + # check stream length + + set stream_length [file size tmpdir/0003] + + if { $stream_length != [expr 0x40 + $mod_info_size + $section_contribution_size + $section_map_size + $source_info_size + $type_server_map_size + $mfc_type_server_index + $optional_dbg_header_size + $ec_substream_size] } { + return 0 + } + + return 1 +} + if ![ld_assemble $as $srcdir/$subdir/pdb1.s tmpdir/pdb1.o] { unsupported "Build pdb1.o" return @@ -51,3 +294,27 @@ if ![string equal [get_pdb_name "tmpdir/pdb1.exe"] "pdb1.pdb"] { } pass "PDB filename present in CodeView debug info" + +if [check_pdb_info_stream tmpdir/pdb1.pdb [get_pdb_guid "tmpdir/pdb1.exe"]] { + pass "Valid PDB info stream" +} else { + fail "Invalid PDB info stream" +} + +if [check_type_stream tmpdir/pdb1.pdb "0002"] { + pass "Valid TPI stream" +} else { + fail "Invalid TPI stream" +} + +if [check_type_stream tmpdir/pdb1.pdb "0004"] { + pass "Valid IPI stream" +} else { + fail "Invalid IPI stream" +} + +if [check_dbi_stream tmpdir/pdb1.pdb] { + pass "Valid DBI stream" +} else { + fail "Invalid DBI stream" +}