Major ARM bots failure

Renato_Golin1 · December 5, 2014, 10:46pm

Folks,

I'm not sure what happened, but some commit in this build made Clang
run forever on ARM:

http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/2203

The missing commits are to the sanitizers and they are not even
checked out on this bot, so I don't think it has anything to do with
that. It's also unlikely to be something on the Hexagon back-end,
since all changes are self-contained.

Any ideas?

I had to completely turn off all bots, since the tests time out and
the next build starts with the current build running at 100% CPU,
accumulating, and killing the board.

cheers,
--renato

Daniel_Sanders · December 5, 2014, 11:10pm

I'm not sure if it helps but the clang builder for mips has been dying of timeouts since http://lab.llvm.org:8011/builders/clang-cmake-mips/builds/389 which only has one commit in the blamelist (r223478 - LLVMContext: Store APInt/APFloat directly into the ConstantInt/FP DenseMaps.). I see the same commit in the build you linked to.

majnemer · December 5, 2014, 11:49pm

r223478 seems to be responsible for PR21770: http://llvm.org/bugs/show_bug.cgi?id=21770

d0k · December 6, 2014, 12:03am

r223478 seems to be responsible for PR21770: 21770 – revision 223478 breaks Linux/i686 build

Reverted for now. Not sure what's going on there. Sorry for the breakage.

- Ben

Renato_Golin1 · December 6, 2014, 12:42am

No worries, at least that was easy to spot. Huzzah for buildbots!

cheers,
--renato

Daniel_Sanders · December 6, 2014, 11:45am

clang-cmake-mips didn't turn green immediately after the revert. I just checked on the machine and it appears that this was because of a large number of leftover processes from the build that timed out. After rebooting, it's turned green.

I'm curious about the reason these processes didn't die when buildbot timed out the build. Does lit clean up its subprocesses when it's killed?

No worries, at least that was easy to spot. Huzzah for buildbots!

Yep, these things happen sometimes. Also, it's given me the perfect example of why continuous builds is better than the nightly buildbots we use internally at the moment. Once we're past the 3.5.1 release, I really need to push our upgrades along.

Renato_Golin1 · December 6, 2014, 12:44pm

The same happened to our bots. It’s because the processes don’t die and buildbot thinks it’s dead and starts a new cycle.

I had to stop the bot, kill all remaining processes and restart the bot. That did it.

Maybe we need an extra step on the bots to make sure the previous instance is not running, or a signal to kill it when the master gives up on time out.

Cheers,
Renato

Topic		Replies	Views
buildbot failure in LLVM on clang-cmake-thumbv7-a15-full-sh LLVM Dev List Archives	6	96	September 29, 2015
Buildbots timing out on full builds LLVM Dev List Archives	20	89	June 1, 2017
Buildbot clang-cmake-mips BUG? LLVM Dev List Archives	4	95	April 26, 2017
Buildbots timeout LLVM Dev List Archives	8	81	October 13, 2015
buildbot failure in LLVM on clang-cmake-mips LLVM Dev List Archives	9	95	October 5, 2015

Major ARM bots failure

Related Topics