[PSA] program point semantics change

cxy · September 28, 2024, 12:05am

This is a friendly PSA to inform everyone that the semantics of program points in MLIR dataflow analysis will be modified by this PR([mlir] [dataflow] unify semantics of program point by cxy-1993 · Pull Request #110344 · llvm/llvm-project · GitHub).

The concept of a ‘program point’ in the original data flow framework is ambiguous.It can refer to either an operation or a block itself. This representation has different interpretations in forward and backward data-flow analysis. In forward data-flow analysis, the program point of an operation represents the state after the operation, while in backward data flow analysis, it represents the state before the operation. When using forward or backward data-flow analysis, it is crucial to carefully handle this distinction to ensure correctness.

This patch refactors the definition of program point, unifying the interpretation of program points in both forward and backward data-flow analysis.

How to integrate this patch?

For dense forward data-flow analysis and other analysis (except dense backward data-flow analysis), the program point corresponding to the original operation can be obtained by getProgramPointAfter(op), and the program point corresponding to the original block can be obtained by getProgramPointBefore(block).

For dense backward data-flow analysis, the program point corresponding to the original operation can be obtained by getProgramPointBefore(op), and the program point corresponding to the original block can be obtained by getProgramPointAfter(block).

NOTE: If you need to get the lattice of other data-flow analyses in dense backward data-flow analysis, you should still use the dense forward data-flow approach. For example, to get the Executable state of a block in dense backward data-flow analysis and add the dependency of the current operation, you should write:

getOrCreateFor<Executable>(getProgramPointBefore(op), getProgramPointBefore(block))

In case above, we use getProgramPointBefore(op) because the analysis we rely on is dense backward data-flow, and we use getProgramPointBefore(block) because the lattice we query is the result of a non-dense backward data flow computation.

related dsscussion: [RFC] Unify the semantics of program points - #8 by cxy

All relevant discussions are welcome.

Topic		Replies	Views
[RFC] Unify the semantics of program points MLIR mlir	8	398	August 24, 2024
[RFC] A DataFlow Analysis Framework MLIR	42	4044	June 30, 2022
DataFlowAnalysis.h questions MLIR	6	679	May 12, 2021
How to deal with control-flow or data-flow dependency analysis? Beginners mlir	0	62	November 25, 2024
Learning MLIR DataFlowAnalysis MLIR	0	284	December 5, 2022

[PSA] program point semantics change

Related topics