Bootstrapping and the Runtimes Build: do I understand it correctly?

kjcamann · June 13, 2022, 6:33pm

Hello everyone!

I am trying to wrap my head around the “runtimes build” design and how it applies to what I’m trying to do: bootstrap a toolchain that can act as either an X86 host compiler or as a cross-compiler for a baremetal ARM embedded system.

IIRC, the rationale of the runtimes build is that you want to build the runtimes with the just-built compiler, to produce a full coherent toolchain. I like this idea, but don’t see how it could work in one stage currently, because of the pesky culprit of libc.

I would think the process could go like this:

Build clang
Build the builtins from compiler-rt with LLVM_BUILTIN_TARGETS=default;aarch64-unknown-elf and in my case also BUILTINS_aarch64-unknown-elf_COMPILER_RT_BAREMETAL_BUILD=ON
Build libc so that the downstream runtimes (e.g., libc++) could actually succeed, and set the RUNTIMES_aarch64-unknown-elf_CMAKE_SYSROOT to point to it
Build the other runtimes

The trouble is #3, since I am not sure how to orchestrate the building of a libc from the LLVM CMake build system. I know LLVM has a libc itself, but I am guessing it is not at the stage where it can exist completely freestanding (i.e., not needing another libc that it interposes?) And even if so, I don’t think being super lean for embedded was a design goal in any case.

So my question is: have I understood the situation correctly?

And what is the best way to deal with it? I can think of a few ways, e.g., a wrapper script that does three stages:

Stage 1: run CMake the first time, building only a host/AARCH64 clang and a compiler-rt as the only runtime, and enable the builtins for AARCH64
Stage 2: (outside of LLVM) use the new AARCH64 clang and its builtins to build an embedded libc and stash it somewhere
Stage 3: run CMake a second time, building everything and now we have the AARCH64 SYSROOT pointing at a libc

Thanks for your help!
Ken

efriedma-quic · June 13, 2022, 6:51pm

The “runtimes” build is basically a shortcut to allow building both the compiler and the associated runtimes in one CMake command. If you’re trying to build an external libc as part of your build process, this probably won’t work, as you’ve noted.

I think what most people do for scenarios like that is use separate CMake invocations for the runtimes. See, for example, “build it separately” directions at https://compiler-rt.llvm.org/ .

“run CMake a second time, building everything” might also work, but that means you’re building the compiler twice, so your builds are roughly twice as slow.

It might be possible to add some CMake glue to invoke an external build system specifically to build libc at the right point in the build. But I expect most people building toolchains would prefer to use separate invocations; it’s much easier to understand if something goes wrong.

Topic		Replies	Views
[RFC] Strategies for Bootstrapping Compiler-RT builtins LLVM Dev List Archives	27	209	November 3, 2015
Upcoming change with how libc++, libc++abi and libunwind are being built LLVM Dev List Archives	31	659	August 24, 2022
[RFC] A vision for building the runtimes LLVM Dev List Archives	15	128	October 27, 2020
Build dependency between tools (clang, lld) and runtimes Runtimes	2	270	March 7, 2023
Bootstrapping clang LLVM Dev List Archives	1	110	March 2, 2020

Bootstrapping and the Runtimes Build: do I understand it correctly?

Related topics