llvm-objdump: syntax highlighting based on rich disassembly

Seiya_Nuta · July 8, 2019, 8:20am

Hi all,

I'm going to implement syntax highlighting in llvm-objdump based on
Rich Disassembly[1] and I'd like to hear your comments on how should I
implement it. Now I have two ideas to implement the feature:

(a) Make MCInstPrinter return a well-typed value and traverse it in
llvm-objdump to highlight the disassembly.
(b) Parse the rich disassembly output string in the llvm-objdump and
highlight the disassembly.

Making MCInstPrinter return the "well-typed" marked-up value just like
abstract syntax tree instead of writing an annotated string into a
raw_ostream sounds more preferable way to me. However, it would
involve large changes to the existing MCInstPrinter implementations.

In contrast, parsing the rich disassembly output in llvm-objdump
sounds a bit awkward, but we don't need to change the MCInstPrinter at
all. That said, parsing the text surely degrades the disassemble
performance so we should disable the parsing and highlighting by
default. I wrote and uploaded a prototype of this [2].

Do you have any thoughts?

Thanks,
Seiya

[1] https://llvm.org/docs/MarkedUpDisassembly.html
[2] ⚙ D64311 [llvm-objdump] Implement syntax highlighting

Bigcheese · July 9, 2019, 11:59pm

Hi all,

I’m going to implement syntax highlighting in llvm-objdump based on
Rich Disassembly[1] and I’d like to hear your comments on how should I
implement it. Now I have two ideas to implement the feature:

(a) Make MCInstPrinter return a well-typed value and traverse it in
llvm-objdump to highlight the disassembly.
(b) Parse the rich disassembly output string in the llvm-objdump and
highlight the disassembly.

Making MCInstPrinter return the “well-typed” marked-up value just like
abstract syntax tree instead of writing an annotated string into a
raw_ostream sounds more preferable way to me. However, it would
involve large changes to the existing MCInstPrinter implementations.

In contrast, parsing the rich disassembly output in llvm-objdump
sounds a bit awkward, but we don’t need to change the MCInstPrinter at
all. That said, parsing the text surely degrades the disassemble
performance so we should disable the parsing and highlighting by
default. I wrote and uploaded a prototype of this [2].

Do you have any thoughts?

Thanks,
Seiya

[1] https://llvm.org/docs/MarkedUpDisassembly.html
[2] https://reviews.llvm.org/D64311

I really dislike the idea of having llvm-objdump do any assembly parsing.

Another solution would be to have MCInstPrinter also be able to store (range, semantic) pairs that can then be used to insert highlighting.

Michael Spencer

Seiya_Nuta · July 24, 2019, 7:20am

Hi all,

I've uploaded the series of patches for this feature (you can see some
screenshots on the Phabricator):

https://reviews.llvm.org/D65191

Topic		Replies	Views
Making llvm-objdump more like GNU objdump LLVM Dev List Archives	13	382	December 5, 2014
[Proposal] Annotated assembly output LLVM Dev List Archives	8	113	October 18, 2012
LLVM based interactive disassembler LLVM Dev List Archives	2	78	June 4, 2015
strange output from llvm-mc LLVM Dev List Archives	2	82	May 13, 2019
Migrating llvm-objdump and a few other common binutils replacements away from llvm::cl. LLVM Dev List Archives	6	195	November 6, 2018

llvm-objdump: syntax highlighting based on rich disassembly

Related topics