Auto-generated checks with inexact FP results

andykaylor · June 30, 2023, 6:10pm

In early May @nikic committed a change to regenerate the checks in the InstCombine tests. Today, I came across an error that was introduced by that change. This looks like a short-coming in the script that generates the checks.

The test with the problem is llvm/test/Transforms/InstCombine/pow-exp.ll. Before the change, this test contained the following lines:

; Do not change 0xBFE0776{{.*}} to the exact constant, see PR42740
; CHECK-NEXT:    [[MUL:%.*]] = fmul nnan ninf afn double [[E:%.*]], 0xBFE0776{{.*}}

The update script kept the comment, but made the change that the comment said not to make. Nikita’s change updated a huge amount of tests, so I don’t think it would have been reasonable to expect anyone to notice this while reviewing the changes.

I don’t know if there are any other cases like this. I would expect so. In this case, the test is constant folding a call to pow(). The problem is that the implementation of this function used by the compiler doesn’t necessarily return correctly rounded results, so checking for the exact result found when updating the test can lead to failures when the test is compiled with a different math runtime library.

Can we change update-test-checks.py to have it preserve wildcards in the constant checks?

efriedma-quic · June 30, 2023, 6:30pm

I think this is user error; the point of auto-generated checks is that they’re auto-generated. If you’re hand-editing the checks, you should remove the “autogenerated by update-test-checks.py” line at the top of the file. And maybe add a note explaining why, if the checks were originally generated.

Maybe it makes sense to make update-test-checks.py try find CHECK lines that don’t appear to be automatically generated, and refuse to update them unless the user explicitly requests it.

nikic · June 30, 2023, 6:45pm

This is correct. The “autogenerated” line is used as a canary to determine whether it’s safe to regenerate the test.

The precise UTC output evolves over time, so it’s probably not possible to robustly determine whether or not the output is auto-generated or not.

For this specific case, I’d probably recommend to just adjust the involved constants to values that are not subject to rounding issues.

pogo59 · June 30, 2023, 8:40pm

Now I’m curious: Is there a way to prevent UTC from auto-generating checks for a file? because in a case like this where something is a little flaky, you wouldn’t want someone to come along and ruin your carefully hand-written checks.

efriedma-quic · June 30, 2023, 8:54pm

There’s an --update-only flag that tells UTC to only updates tests which are auto-generated.

If you write something like ; NOTE: Assertions have been autogenerated by hand, UTC will refuse to update it. This is meant to prevent accidentally switching from update_llc_test_checks to update_test_checks, or something like that, but you can also use it like this. Maybe we could add better syntax for it.

pogo59 · June 30, 2023, 9:23pm

Hmmm nice trick, thanks. It would be good if there was something that even --force would respect, though. Accidents happen.

andykaylor · June 30, 2023, 9:29pm

Thanks everyone for the input. I’m about to go on vacation, but I’ll try to remember to update this test when I get back.

pogo59 · July 3, 2023, 6:46pm

See D154383

Topic		Replies	Views
RFC: Are auto-generated assertions a good practice? LLVM Dev List Archives	6	106	May 4, 2018
Best practices for using update_*_test_checks.py LLVM Dev List Archives	2	687	April 26, 2022
[PATCH] D50328: [X86][SSE] Combine (some) target shuffles with multiple uses LLVM Dev List Archives	4	93	August 13, 2018
ConstantExpr refactoring LLVM Dev List Archives	8	86	July 1, 2012
[ConstantExpr] Adding folding tests LLVM Dev List Archives	3	84	May 3, 2019

Auto-generated checks with inexact FP results

Related topics