[RFC] - Deduplication of debug information in linkers (LLD)

JonChesterfield · December 4, 2017, 11:58pm

At least one proprietary linker put a lot of effort into deduplicating and rewriting debug information. This took up the majority of the link time despite serious engineering time on performance optimisation. For example, some sections were written from scratch by the linker because that proved faster than parsing the input. Teaching LLD to dedup DWARF should be expected to dramatically slow it down (when enabled, ideally not when disabled).

Is a more incremental approach viable? In particular, are there IR passes that fold debug strings etc that could be deployed before feeding everything into a linker?

Rui_Ueyama · December 5, 2017, 4:58am

Jon,

I think what George suggested is different from making lld to parse, deduplicate and rewrite the DWARF debug info. What he suggested is to make the compiler emit multiple debug sections so that the linker can eliminate them just like it does for, for example, inline functions. The elimination is done by (essentially) section name, so it should be quite fast. Parsing all debug info and reconstructing it is completely different IMO.

echristo · December 5, 2017, 11:18pm

And, IMO, a good time for a post processing tool similar to dsymutil or dwz.

Topic		Replies	Views
[Debuginfo][DWARF][LLD] Remove obsolete debug info in lld LLVM Dev List Archives	53	267	August 14, 2020
[RFC] - Deduplication of debug information in linkers (LLD). LLVM Dev List Archives	5	97	December 5, 2017
[RFC] - Deduplication of debug information in linkers (LLD). LLVM Dev List Archives	26	281	December 18, 2017
Remove obsolete debug info while garbage collecting LLVM Dev List Archives	15	132	October 7, 2019
[LLD] Support DWARF64, debug_info "sorting" LLVM Dev List Archives	1	89	November 11, 2020

[RFC] - Deduplication of debug information in linkers (LLD)

Related Topics