[RFC/PSA] Changing the shadow call stack register on RISC-V

asb · March 27, 2023, 9:22am

There’s an active discussion on D146463 about changing the register used for the shadow call stack (the patch description does a good job of summarising things so I won’t repeat here).

Our working assumption is that the shadow call stack register has little/no real-world use and so such a change wouldn’t be disruptive, but please speak up if it would be problematic for you.

kito-cheng · March 28, 2023, 2:11am

Another proposal from me is using gp as platform register: [RFC] Relax gp could be platform specific register rather than reserved for… by kito-cheng · Pull Request #371 · riscv-non-isa/riscv-elf-psabi-doc · GitHub

Some advantage on taking gp as platform register rather than other GPRs:

Compiler doesn’t use gp register anywhere for now.
All assembly files (which conform with current ABI) didn’t use that except the __global_pointer$ initialization code in CRT files.
The main user is linker, linker will use that to perform linker relaxation, and we already have the command option to tune that off.

Potential issues:

Loss the code size and performance gain from gp relaxation
- The most gain from gp relaxation is embedded application, it’s different target audience as SCS, so this should not blocker issues.
- Android is an example, it’s already disable GP relaxation at all, so we don’t have any loss for this case.
Will it break any existing platform?
- Treat gp as platform register is optional, it’s still default use as gp relaxation, so NO breakage on existing platform, but give the freedom of the platform to use gp register as other purpose if they don’t want gp relxation.
- Added an attribute to let linker to help mixing up different gp usage objects, also linker could check that attribute to make sure gp relaxation is do-able or not.

ilovepi · March 28, 2023, 9:59pm

We do not want to slow this process down, because it is important that we come to a resolution as soon as possible, but we do want to raise these concerns now, while there is still time to take them into account.

Ideally, projects that place a premium on code size wouldn’t have to forgo code size savings from global relaxation if they want to use SCS, or make use of the platform register. We see no fundamental reason why users in the embedded space who care deeply about code size would not fall into this category, and it would be nice if they could also make use of the feature without being required to drop global relaxation.

As mentioned in the sig-toolchain meeting and the ps ABI issues, there are many benefits to using gp, but before we make a choice we should be sure that we are explicit about the tradeoffs are why we believe they are worth making.

kito-cheng · March 30, 2023, 7:23am

Just cross post my reply here so that anyone who not following the RISC-V psABI repo can know the follow up discussion:

github.com/riscv-non-isa/riscv-elf-psabi-doc

Specify a platform reserved register

opened 03:41PM - 20 Mar 23 UTC

closed 12:32PM - 17 Apr 23 UTC

appujee

ARM reserves x18 for platform but RISC-v doesn't specify a platform register. … https://developer.arm.com/documentation/den0024/a/The-ABI-for-ARM-64-bit-Architecture/Register-use-in-the-AArch64-Procedure-Call-Standard/Parameters-in-general-purpose-registers Recent changes in llvm has reserved the same `x18` for Android and Fuschia platforms to align with the **name** w.r.t. ARM spec. But x18 may not be an ideal choice for RISC-V because: - it is the 3rd callee saved register - `-msave-restore` does not interact well if `x18` gets reserved. Some discussion has been conitnuing here: https://github.com/google/android-riscv64/issues/72 but it would be ideal to have a spec or at least a recommendation on choice of platform registers and trade-offs around that In the current setup `x27` appears to be a better choice as this would reduce: - interaction with `-msave-restore` - amount of work in assembler code out there. `X18` is used in more places (3rd callee saved register) than X27.

Thanks for the comment, SCS is kind of special is that eventually will break the ABI (for RISC-V) since that require one extra reserved register, so that give we few more freedom to having more option than other ABI issues.

Of cause the why it become an issue that must did a ABI breakage change is we didn’t specify a platform register at beginning, but anyway the boat sailed.

So that’s back to the SCS first, I think we have three options:

Pick a GPR as SCS
Re-define gp to allow that use other than gp relaxation.
Use Zisslpcfi extension, that provide dedicated instruction and CSRs to implement SCS.

Pick a GPR as SCS

There are actually two candidate during the discussion x18 and x27, but x18 is kind of many potential issues, so now we are discussion the other candidate, and then x27 has purposed.

x27 obviously better than x18 just like @appujee has listed on the first post.

but it’s still has low probability might screwed up in some asm code since it was not reserved before, so I am not prefer to pick up a non-reserved register if possible.

Re-define gp to allow that use other than gp relaxation.

Already listed several reason in #371, so not duplicate here, so just jump to the concern @ilovepi raised, what if user want SCS and gp relaxation, I think it’s the most potential drawback of this proposal.

	GP Relax	No GP relax
SCS Enable	NOT OK	OK
SCS disable	OK	OK

What’s the possible solution if people want more code size saving AND SCS?

Use Zisslpcfi, that would be most simple but require HW support.
GlobalMerge optimization pass from LLVM*, it could archive similar optimization like gp relaxation, but for local, that should be able to improve by LTO.
Proposal around the EABI/deviations: using tp as gp (this can be only apply on those system not require thread)

NOTE: * GCC has similar stuffs but called section anchor optimization.

Use `Zisslpcfi` extension, that provide dedicated instruction and CSRs to implement SCS.

Okay, that should be everyone happy, no extra reserved register needed since the extension has provide dedicated one, the only issue is that require HW has implement that.

Compare

Just drop Zisslpcfi from the comparison table since it’s not the point in this thread.

	Option 1	Option 2
Used in existing asm code?	Yes *1	No
Impact code size	Yes, but very minor	Yes *2
Break ABI	Hard break	Soft break compare to option 1

So based on the comparison table and several reason listed in #371, I believe we should go with option 2

*1: google/android-riscv64#78 Android have an issue to tracked where has use the SCS register candidate.
*2: We have some compiler optimization to make up the gap like GlobalMerge optimization, and be noted there is no loss on those system already disable GP relaxation.

Topic		Replies	Views
RISC-V calling convention implementation in clang: tp and gp registers RISCV	5	589	January 4, 2024
Software shadow call stack run-time support for RISC-V Sanitizers riscv	4	204	August 7, 2025
MIPS & GP register LLVM Dev List Archives	9	179	August 17, 2012
Question about RISCV gp-relaxtion LLVM Dev List Archives	3	147	October 19, 2021
Named register variables GNU-style LLVM Dev List Archives	28	178	April 27, 2014

[RFC/PSA] Changing the shadow call stack register on RISC-V

Pick a GPR as SCS

Re-define gp to allow that use other than gp relaxation.

Use Zisslpcfi extension, that provide dedicated instruction and CSRs to implement SCS.

Compare

Related topics

Use `Zisslpcfi` extension, that provide dedicated instruction and CSRs to implement SCS.