Can I use clang to convert C/C++ code to a list of tokens?

Hi all,

I’m wanting to take some C/C++ code and convert it to a list of tokens. Tokens that a compiler would use. I’m interested in the literal name of the token, specifically. If I could get something that’d return to me something along the lines of “this code you provided to me contains the following in this order: variable, plus operator, minus operator, if statement start, if statement end” etc. That is what I’m looking for.

Is this something that I can use clang to help me with?


Yes. You might want to look at libclang, which exposes a C interface for
parsing source code.

--Kyle Sluder