Find instruction's offset

Fami_H · January 9, 2017, 10:07pm

Hi,
Is there a way to get instruction’s offset at compile time with llvm for ARM?
I am trying to create a map between instructions at compile time and this run-time info. Since PC is a relative value, I am trying to use the instruction’s offset as a constant property of instruction to create this map. I think offset information should be available to create the executable, if so where to find it?
Thank you for your help,
Fami

Fami_H · January 23, 2017, 5:29pm

I don’t know if my question was super easy or maybe it is not clear. I’m still in need of the answer and appreciate any help.When LLVM creates assembly code it should somehow create instructions’ program counter (offset) right? how to get this information?

Thank you,
Fateme

Jeremy_Lakeman1 · January 23, 2017, 11:57pm

Debug metadata is generally the only way to link a byte of machine code back to the original source location. So that tools which understand the source language can help you see how your program runs. But even that is inaccurate.

There is no other metadata linking assembly instructions back to any Value* in the intermediate representation. The goal of every optimisation is to make the whole program faster, not to keep track of how we got here.

There is often a huge difference between how you imagine something works, and how it actually works.

If you want a better answer, try to explain what you want to do and why. We aren’t mind readers, and we can’t piece together what you want from a description of how you imagine something works.

Eli_Friedman · January 23, 2017, 11:59pm

Your question isn't really clear what kind of offset you need, or exactly you're planning to do with the offset.

In assembly, if you have two labels in the same section, you can write the offset between them using subtraction, e.g. ".L3 - .L1". The assembler will resolve this to an actual number.

-Eli

Bruce_Hoult · January 24, 2017, 12:03pm

LLVM creates something like assembly language (or even real assembly language) and doesn’t know the exact offset of an instruction within the function (or section).

The assembler might create different sized instructions – or even multiple instructions – from something that LLVM just thinks of as “an instruction”. For example because of different instruction encodings or even instruction sequences being needed because of the size of immediate values or branch offsets (often not even known until link time). Even things such as whether or not a REX prefix is needed on AMD64 depends on whether some register mentioned in the instruction is in the high 8 registers – LLVM could pay attention to that, but I’d suspect it doesn’t.

On some platforms, instructions within a function can even be added or deleted by the linker.

Your best best may be to take the final linked program and run objdump on it.

Fami_H · January 30, 2017, 4:24am

Bruce,
Objdump was exactly what I needed!
Thanks everyone. Sorry If I my question was not clear enough.

Topic		Replies	Views
Is it possible to get the exact PC address of an instruction? LLVM Dev List Archives	1	100	June 9, 2020
llvm-mc assembler, GNU as, and pc-relative branches for Arm/AArch64/Mips LLVM Dev List Archives	3	128	January 10, 2018
Getting started with LLVM Passes LLVM Dev List Archives	2	68	July 15, 2005
[LLVM::IR] How to retrieve llvm::Instruction address after JIT code generation? LLVM Dev List Archives	2	79	March 6, 2017
LLVM Metadata to Dwarf tags LLVM Dev List Archives	1	70	April 11, 2012

Find instruction's offset

Related Topics