Why do we static link all llvm libraries in every executable?

Yin_Ma1 · January 25, 2016, 6:13pm

Hi

I found basically all llvm libraries are statically linked into each executable and LLVMgold.so,

This make the clang/llvm package larger and larger with a lot of duplicated code. If I build

debug version, the disk space required is even larger. Is there any particular reason to keep

doing this way? If we separate several shared libraries something like libclang.so, libllvm.so

and let all executables and llvmgold.so just linked with .so. A lot of space could be saved and

loading performance could be improved.

Yin

Joerg_Sonnenberger1 · January 25, 2016, 6:16pm

There is a build option to do exactly that. It comes at a significant
price for startup, e.g. clang will take 10x as long for building a small
example.

Joerg

serge_guelton2 · January 25, 2016, 7:15pm

Namely BUILD_SHARED_LIBS=ON

I find it very useful for dev builds!

Yuri_Gribov1 · January 26, 2016, 8:44am

If we separate several shared libraries something like
libclang.so, libllvm.so and let all executables and llvmgold.so just
linked with .so. A lot of space could be saved and
loading performance could be improved.

There is a build option to do exactly that. It comes at a significant
price for startup, e.g. clang will take 10x as long for building a small
example.

Is this to process runtime relocations or run constructors? I wonder if Prelink or ElfHack could help.

dblaikie · January 26, 2016, 3:45pm

If we separate several shared libraries something like
libclang.so, libllvm.so and let all executables and llvmgold.so just
linked with .so. A lot of space could be saved and
loading performance could be improved.

There is a build option to do exactly that. It comes at a significant
price for startup, e.g. clang will take 10x as long for building a small
example.

Is this to process runtime relocations or run constructors? I wonder if
Prelink or ElfHack could help.

Runtime relocations, I would imagine (global ctors would have to run in
either mode - so shouldn't represent a difference, I would think?)

Yin_Ma1 · January 26, 2016, 9:32pm

Hi,

Thank you for explaining and providing the option. I will give a try.

10x slower…on Linux? If we limit the number of global symbols exposed, it will

help the situation?

Yin

Ben_Craig · January 26, 2016, 9:39pm

Properly separating global / default symbols from internal / hidden symbols would take a substantial amount of effort. To put the amount of effort in perspective, there are more than 700 LLVM headers that provide the interface between the LLVM static libraries. There are more than 300 Clang headers to provide the interface between the Clang static libraries.

I think the change could be done, and it would be valuable, but it isn’t a quick change. You would also get to deal with all the non-portable platform peculiarities.

Yuri_Gribov1 · January 27, 2016, 8:54am

Properly separating global / default symbols from internal / hidden
symbols would take a substantial amount of effort. To put the amount of
effort in perspective, there are more than 700 LLVM headers that provide
the interface between the LLVM static libraries. There are more than
300 Clang headers to provide the interface between the Clang static
libraries.

I think the change could be done, and it would be valuable, but it isn't
a quick change. You would also get to deal with all the non-portable
platform peculiarities.

A pity Prelink does not support this use-case (prelinking just a subset of loaded libraries).

David_Chisnall3 · January 27, 2016, 9:21am

10x slower seems like an exaggeration. I tried doing a shared and non-shared build of LLVM and running the LLVM+Clang test suites. The shared library version used 40% more CPU time[1]. I did not compare to a PIC-but-not-shared-library build yet, which would give another interesting data point (i.e. how much do we lose from PC-relative addresses rather than absolute, vs how much do we lose from dynamic relocations vs static).

David

[1] Note: I only did it once, take these results with a grain of salt, though given that they used several hours of CPU time each, there’s probably not a huge variation expected over multiple runs.

Joerg_Sonnenberger1 · January 27, 2016, 12:09pm

Test program is just "int main(void) { return 0; }":

Monolithic clang as used by NetBSD's cross build system: 0.008s
Shared clang using normal cmake build: 0.139s

Both optimised builds with asserts enabled.

Joerg

Yuri_Gribov1 · January 27, 2016, 12:20pm

10x slower..on Linux? If we limit the number of global symbols exposed, it will
help the situation?

10x slower seems like an exaggeration.

Test program is just "int main(void) { return 0; }":

Monolithic clang as used by NetBSD's cross build system: 0.008s
Shared clang using normal cmake build: 0.139s

Both optimised builds with asserts enabled.

Just curious, was this HDD or SSD?

Joerg_Sonnenberger1 · January 27, 2016, 2:06pm

SSD, but pretty much irrelevant due to a hot cache.

Joerg

Topic		Replies	Views
Why are LLVM releases statically linked against LLVM libraries? LLVM Dev List Archives	2	98	September 22, 2017
why doesn't LLVM link its dependencies dynamically? LLVM Dev List Archives	3	78	October 7, 2008
Statically linking against libc++ LLVM Dev List Archives	10	89	April 13, 2017
Why LLVM libraries are static? LLVM Dev List Archives	2	92	March 1, 2006
Build a static-linked executable using llvm LLVM Dev List Archives	2	86	January 8, 2011

Why do we static link all llvm libraries in every executable?

Related Topics