[RFC] LLD: Add support for GCC LTO format

Tulio_Magno_Quites_M · July 2, 2025, 1:47pm

Introduction

GCC provides a linker pluging via liblto_plugin.so that exposes an interface for linkers to support the GCC LTO format.

Adding support for this format to LLD will allow users to link object files generated with GCC LTO.

Description

This is not feature complete yet, but this prototype is able to link simple executables.
I believe it’s a good time to start getting early feedback from the community in order to guide me while I implement the missing features.

The code is available in my Github repository.

It’s split in 3 commits:

Add support for -plugin and update tests;
Split the LTO code and start adding an initial implementation for GCC;
Add the remaining implementation for linking a simple executable;

During this work, I decided to not add support for option -pass-through=. While GCC may use it, ld.bfd completely ignores it. Example of GCC usage:
-plugin-opt=-pass-through=-lgcc -plugin-opt=-pass-through=-lgcc_s -plugin-opt=-pass-through=-lc -plugin-opt=-pass-through=-lgcc -plugin-opt=-pass-through=-lgcc_s.

teresajohnson · July 2, 2025, 2:06pm

@MaskRay for thoughts. I found this old issue, which indicates an explicit decision was made to not support GCC LTO from lld: lld can't handle gcc LTO files · Issue #41791 · llvm/llvm-project · GitHub

tobiashieta · July 2, 2025, 3:50pm

I think there might be licensing ramifications for this. Have you looked into if it would be compatible to load a GPLv3 plugin into LLVM?

Tulio_Magno_Quites_M · July 2, 2025, 9:43pm

I am not a lawyer.

As you mentioned, liblto_plugin is licensed GPLv3. Apache v2.0 and GPLv3 are compatible between themselves. There is also agreement on this topic in this thread from 2015.

AFAIU, the situation is very similar to using Clang to build LLD and linking to libstdc++ or libgcc.

With that said, I believe there are some downstream members of the community that may not want to get LLD linked to GPLv3 software. In that case, we could adopt the suggestion from Github user Artoria2e5 (thanks @teresajohnson for the link):

On the CMake configuration side, this may take the form of:

An LLVM_ENABLE_GPL option that allows for a GPL build

An LLVM_ENABLE_LLD_BFD_PLUGIN option that actually controls the feature

It doesn’t hurt to mention these macros would be disabled by default.

Endill · July 3, 2025, 8:19am

Can you clarify what this compatibility actually means? To my understanding, this is a one-way compatibility: GPLv3-licensed projects can use code under Apache 2.0, because the latter doesn’t place any additional restrictions beyond the ones in GPLv3. The opposite is not true, however. Which, I believe, is one of the reasons why GPLv3-licensed test code lives in llvm-test-suite repo to avoid tainting the monorepo.

As we’ve been pointed out many times over the years, legal questions regarding the project should be sent to LLVM Foundation Board to get an answer from an actual lawyer.

Tulio_Magno_Quites_M · July 3, 2025, 2:01pm

Can you clarify what this compatibility actually means?

This is the kind of question that is best answered by a lawyer, which I am not.
With that said, let me try to get you an answer.
The FSF explains it as:

What does it mean to say a license is “compatible with the GPL”?
It means that the other license and the GNU GPL are compatible; you can combine code released under the other license with code released under the GNU GPL in one larger program.

Their explanation is longer and also give more details about the difference between GPLv2 and GPLv3.

Which, I believe, is one of the reasons why GPLv3-licensed test code lives in llvm-test-suite repo to avoid tainting the monorepo.

Just to make it clear: all the code contributed to the monorepo in this RFC will be licensed Apache v2.
This Apache v2 code, when enabled, will be dynamically linked to a library that is licensed GPLv3.

As we’ve been pointed out many times over the years, legal questions regarding the project should be sent to LLVM Foundation Board to get an answer from an actual lawyer.

AFAIU, this proposal is not doing anything new regarding licensing.
As explained before, linking to GPLv3 libraries already happen when lld gets linked to libstdc++ or libgcc after being compiled by Clang for example.

It doesn’t hurt to repeat the proposal in my last comment:

The new code would be disabled by default.
In order to enable it, one would have to enable a CMake macro that makes it clear the user wants to link to GPLv3.

tobiashieta · July 3, 2025, 2:05pm

I don’t think this is right. I am no lawyer either - but I am pretty sure it’s different to load GPL code into LLVM compared to linking something where the output contains GPL.

I think you need to contact the foundation and let their lawyer look at this before we move forward with your patch.

thesamesam · July 4, 2025, 1:15am

See The LLVM gold plugin — LLVM 21.0.0git documentation as well wrt the earlier mentioned plugin. I really don’t see that as different to this.

thesamesam · July 4, 2025, 1:47am

I’ll note that mold went on to support these plugins. I think the author changed their mind on design, at least partly.

Tulio_Magno_Quites_M · July 4, 2025, 1:20pm

Thank you! I have followed @tobiashieta 's suggestion and contacted the board.
Let’s wait for their reply.

Meanwhile, I’d appreciate if the community could also review the technical side of this contribution or if this would be valuable or not.

teresajohnson · July 4, 2025, 2:51pm

From a practical perspective, who would be the maintainer of the gcc support in lld, address user issues, and how would it be tested (e.g. do you plan to add a public build bot)?

Tulio_Magno_Quites_M · July 4, 2025, 8:04pm

I’m available to do that work.

IMHO we can try to reuse Linux builders that already have gcc installed, e.g. llvm-clang-x86_64-gcc-ubuntu in order to run the tests that still need to be developed.
If this is not possible, I volunteer to add a new BuildBot builder (not to be confused with worker/machine).
If the new test ends up requiring a new worker/machine, then I’ll need to look for options.

My main goal is to enable this feature in Fedora and RHEL.
That means we will have daily test runs from GitHub - fedora-llvm-team/llvm-snapshots: Everything to build LLVM snapshots for Fedora/RHEL/CentOS Stream
While this is not ideal, the worst case will still have tests running daily and people reviewing the results.

ruiu · July 8, 2025, 1:05am

mold does indeed support LTO using the LTO plugin mechanism. This is the only way to support LTO for both GCC and LLVM, and I found it to be quite useful at times, because it allows the compiler and linker to be updated independently. OTOH, lld and LLVM must be of the same version to do LTO.

FWIW, I don’t think it makes sense to include GNU binutils’ plugin-api.h just for the constants declared in that file. You should define them yourselves instead so that users don’t have to teach where the file is at the build time. Here’s a list of the constants required to support the LTO plugin: mold/src/lto.h at main · rui314/mold · GitHub

Meinersbur · July 8, 2025, 1:04pm

whopr seems to be primarily developped for gold:

Although this document focuses on gold, a similar approach can also be implemented in GNU ld.^[1]

At the same time gold has been deprecated:

Perhaps the most significant change is the absence of the “gold” linker, which is deprecated and about to disappear entirely. Gold appeared in 2008 with some fanfare as a faster linker, but it has suffered from a lack of maintenance in recent years.^[2]

How are these bfd/gold/whopr and LTO related? What is different to LLVMgold.so? How does the eventual removal of gold affect this?

ruiu · July 9, 2025, 1:51am

My understanding is that the reason GCC’s documents mention gold in the LTO context is because the LTO plugin was originally developed for gold. After that, BFD ld and mold gained support for GCC LTO using the same plugin mechanism, and their support for LTO is now on par with gold’s. So, IIUC, as long as a linker supports the plugin mechanism, it supports all GCC LTO features.

Tulio_Magno_Quites_M · July 10, 2025, 12:40pm

Regarding whopr, I suggest to read this section from GCC Internals in order to understand it:

Users of GCC LTO+gold will be forced to migrate to ld.bfd or mold.
If we implement the feature from this RFC, lld will also become an option.

MaskRay · July 27, 2025, 1:11am

I’m grateful for your work on implementing the GCC LTO feature.
I try to find similar license discussions. The most relevant one is rustc_codegen_gcc using libgccjit.

github.com/rust-lang/compiler-team

Merge rustc_codegen_gcc backend as compiler/rustc_codegen_gcc

opened 04:08PM - 24 Jun 21 UTC

closed 11:24AM - 08 Jul 21 UTC

antoyo

T-compiler major-change major-change-accepted

# Proposal [`rustc_codegen_gcc`](https://github.com/antoyo/rustc_codegen_gcc)… is a new code generation backend for rustc using the `libgccjit` library from GCC. (Despite its name, `libgccjit` works for ahead-of-time compilation as well.) `rustc_codegen_gcc` will allow Rust to target the wider set of architectures that GCC supports. It'll also allow us to generate code optimized via GCC, which in some cases can provide better code generation. This MCP proposes incorporating `rustc_codegen_gcc` into `rust-lang/rust` as `compiler/rustc_codegen_gcc` (using `git subtree`), alongside the other code generation backends. This MCP also proposes gating CI on `rustc_codegen_gcc` building, but *not* on it passing any tests. `rustc_codegen_gcc` currently passes the entire `core` testsuite; work on the remainder of the testsuite is in progress. `rustc_codegen_gcc` benefits from the existing infrastructure to annotate tests as requiring a specific backend, so that it doesn't attempt to pass LLVM-specific tests. If this MCP is accepted, we'll subsequently submit PRs adding it to `rust-lang/rust` and adding it to the build process. We'll also make a PR to the highfive bot, to automatically CC @antoyo on changes to `compiler/rustc_codegen_gcc`. In the future, we'll make a separate proposal to distribute `rustc_codegen_gcc` via `rustup`. ## Licensing `rustc_codegen_gcc` uses the same license as rustc: dual MIT / Apache-2.0. The `libgccjit` library that `rustc_codegen_gcc` depends on uses the same license as GCC: GPLv3-or-later. **This won't affect users of rustc at all, and it won't affect distributors of rustc who do not build or distribute the GCC backend.** Distributors of rustc (including the Rust project itself) who do choose to build or distribute the GCC backend will need to provide the full source for their distribution of rustc under a GPL-compatible Open Source license; `rustc` and all its dependencies are under GPL-compatible Open Source licenses, so in practice this just means that distributors of `rustc` who choose to build and distribute the GCC backend need to supply full source code. This does not seem like a practical issue, nor does it change rustc's normal permissive licensing policy, as anyone who wishes to use rustc under a permissive license may simply avoid building or distributing the GCC backend. We hope that in practice, Linux distributions will build and distribute the GCC backend once it passes enough of the testsuite to be widely useful, and especially once we have targets that depend on it. Other distributors of `rustc` may choose whether to build and distribute the GCC backend based on their needs. We will never make any portion of rustc other than `rustc_codegen_gcc` depend on `libgccjit`. Given the value of a GCC backend in expanding Rust's reach to more targets, and thus enabling the use of Rust in projects that need to continue supporting such targets, we believe this represents a reasonable step that will not in practice affect anyone's use, development, or distribution of rustc. ## Authors @antoyo is the primary author of `rustc_codegen_gcc`, and will continue to maintain it once merged. @joshtriplett helped with this MCP, and provided guidance and recommendations on licensing. # Mentors or Reviewers Not sure who to put here. # Process The main points of the [Major Change Process][MCP] are as follows: * [x] File an issue describing the proposal. * [ ] A compiler team member or contributor who is knowledgeable in the area can **second** by writing `@rustbot second`. * Finding a "second" suffices for internal changes. If however, you are proposing a new public-facing feature, such as a `-C flag`, then full team check-off is required. * Compiler team members can initiate a check-off via `@rfcbot fcp merge` on either the MCP or the PR. * [ ] Once an MCP is seconded, the Final Comment Period begins. If no objections are raised after 10 days, the MCP is considered **approved**. You can read [more about Major Change Proposals on forge][MCP]. [MCP]: https://forge.rust-lang.org/compiler/mcp.html # Comments

For better or worse, the FSF holds copyright on libgccjit (FWIW, I used to be OK with this, but I’ve been reconsidering my views on the FSF lately …but that’s a whole other issue).

libgccjit is a GPLv3 library, in particular, it’s essentially a thin wrapper around GCC’s implementation (but designed to work as a shared library rather than a command-line tool). Despite the name, it can also be run as an ahead-of-time compiler, which is how this project is using it..

As I understand it, any host code directly linking with libgccjit needs to comply with the GPLv3, but the target code generated by libgccjit isn’t affected by the GPLv3 (but might link against the target libgcc runtime library, which has its own license); this is analogous to the classic usage of GCC as a command-line tool. My understanding is that the FSF is OK with GCC being used to develop code under other licenses (including proprietary), and GCC’s license only affects that code in-as-much as it links to the target libgcc runtime library (which is under a different license). It might be worth having your counsel check that license.

Note, LLVMgold.so, built from llvm/tools/gold, support all of GNU ld, gold, and ar, despite “gold” in its name.
ar requires a symbol table (not useful nowadays Archives and --start-lib | MaskRay )

Since LLVMgold.so is a shared object, its dependency scenario differs from adding a dependency to the LLD executable.

Tulio_Magno_Quites_M · August 19, 2025, 7:12pm

I’ve received a reply from the LLVM Foundation board.
They confirmed there is no problem contributing regular Apache 2.0 WITH LLVM-exception-licensed code that links to GPLv3 code; the two licenses are compatible.

They also clarified that static linking, dynamic linking or loading does not have a difference when the licenses involved are Apache 2.0 and GPLv3.

rnk · August 20, 2025, 9:53pm

I think there is an important caveat that these license are compatible, but when you link together GPLv3 and Apache 2.0 licensed code, the resulting binary is a derived work that carries the requirements of the GPLv3. We wouldn’t want the LLVM build system to unconditionally statically link GPLv3 libraries, or all of our downstream forks would have to abide by the terms of the GPLv3.

It seems to me (as a non-lawyer) that the plugin architecture is an important aspect of what would allow us to add support for loading GCC-derived plugins, because the plugin architecture ensures that our build system doesn’t produce GPLv3 encumbered artifacts.

Tulio_Magno_Quites_M · August 21, 2025, 5:31pm

Agreed. That’s why I’m planning to follow the proposal from Github’s user Artoria2e5 that will disable building this code by default and will only build it if the user requests for “GPL code” and “GCC LTO support”.

I didn’t understand this part. How does a plugin ensure the build system doesn’t produce GPLv3 encumbered artifacts?

Topic		Replies	Views
libLLVMgold.so: could not load plugin library LLVM Dev List Archives	10	449	June 28, 2010
Please dogfood LLD LLVM Dev List Archives	44	301	March 21, 2017
Debugger support LLVM Dev List Archives	18	128	May 14, 2008
Preferring to use GCC instead of LLVM LLVM Dev List Archives	28	169	May 14, 2008
Preferring to use GCC instead of LLVM LLVM Dev List Archives	9	155	May 13, 2008

[RFC] LLD: Add support for GCC LTO format

Introduction

Description

Related topics