RFC: Updates to developer policy around C++ standards bump

tobiashieta · August 10, 2022, 3:34pm

Hi everyone!

With C++17 merged to main the process caused a bit of problems with bots and peoples local development environment. From those problems there was a long and interesting discussion in Phabricator here: D130689 [LLVM] Update C++ standard to 17

With that background let’s look at the current policy: LLVM Developer Policy — LLVM 16.0.0git documentation

This section doesn’t really address what happens after we have bumped the toolchain requirement and doesn’t address updating the C++ standard at all. It’s obvious we can do better here - so I suggest we add new section to address C++ standard upgrade:

When the toolchain requirement have become a hard requirement in main we can now suggest to bump the C++ standard to the latest supported by the new toolchain requirement. In order to land this we should follow the procedure below.

Suggest the change as a RFC on discourse.
– This RFC should encourage bot owners and other vendors to test this new configuration by running a build with the cmake argument -DCMAKE_CXX_STANDARD=<newstandard> and have them report issues.

Once the RFC reaches consensus post a diff to Phabricator that changes the C++ standard in the CMake files. Here is an example. Link this diff from the discourse post.

When the diff is accepted by a wide range of contributors the post on discourse should be updated and suggest that it will be merged the following weekend. We want to try to merge this on a weekend to minimize the potential disruptions. This post also needs to point out that the change can be reverted so no new features should be used until further notice.

Keep an eye on the bots when the change is rolling out. If there seems to be many bots failing the change should be reverted and fixes should be solicitated.

After a week from the merge it’s now safe to use new features and the discourse post should be updated to reflect this.

This is my first draft based on the feedback in Phabricator - please discuss if this seems like a reasonable plan or if we should tweak it any way.

cor3ntin · August 10, 2022, 3:54pm

Thanks!
I like it, i think the last bullet could be more explicit ie

Patches using new languages features should not be merged until the language mode change is working on all bots and users have had sufficient time to test it (1-2 weeks).

(Sure, things might start using C++N+1 accidentally but this avoid sweeping/automated changes that are hard to rollback)

AaronBallman · August 10, 2022, 4:09pm

Thank you for putting this together! I definitely agree with the direction you’re heading with this.

I think we might want a more firm statement regarding the build lab after the change has been posted. There are some platforms for which we have very little testing unfortunately, so we can run into a situation where most of the build lab is green but it turns out that we’ve lost all testing for something that matters. As a concrete example, we don’t have very many Windows MSVC bots, so it’s actually not that hard to overlook when all three of them are red but everything else is green. That said, I’m not certain how strong of a statement we want to make here.

JDevlieghere · August 10, 2022, 5:16pm

Thanks for doing this, Tobias.

Regarding your first point, to encourage folks to preflight the new language standard, I was thinking we could introduce a CMake warning (for developers, similar to the soft toolchain requirements) that says “hey we’re moving to this new standard, please try it with DCMAKE_CXX_STANDARD and if you do this warning this will go away”. Hopefully getting rid of the warning is enough motivation for folks to give it a try.

AaronBallman · August 11, 2022, 11:05am

Another thought I had is we should probably make it an explicit task that someone follow up with the owners of failing bots after the change has been pushed when we discover bots aren’t meeting the new build requirements. For example, these bots all seem to be down because of the switch (there may be others):

tobiashieta · August 11, 2022, 11:32am

Yeah this is probably a good idea. I did this when we introduced the soft error for toolchain requirements. I wish we had a better way to surface things to bot owners in general - it’s a bit time-consuming and daunting to manually email the bot owners and to figure out who would be responsible for that. I do worry a bit that these procedures becomes very overwhelming to the point where a single contributor will not be willing to start working on it.

AaronBallman · August 11, 2022, 11:53am

FWIW, I share that concern but not very strongly. I don’t want to see us add needless process that is just busywork for people. But a lot of these suggestions are along the lines of ensuring we maintain a high quality product without being unduly disruptive to contributors and downstreams. I think that the amount of effort involved in that sort of work is what it is. We should not skimp on quality or allow disruption just to make it easy to change the default language mode – we only do that once every half decade or so, so the process being heavy isn’t a particularly worrying concern, but needless overhead is (if that makes sense).

DavidSpickett · August 11, 2022, 12:56pm

Just speaking for Linaro, we manage the two Windows on Arm bots you linked and we were aware of the switch from the updates on the original RFC. A message to us would be welcome but not necessary, we should be reacting as it happens (though I admit we have been slow in this case).

AaronBallman · August 11, 2022, 1:13pm

Thanks for that perspective (and thank you for the bots, btw)!

tobiashieta · August 11, 2022, 2:00pm

I think that’s fair.

I will give it a shot to update the initial draft tomorrow with everyone’s feedback incorporated.

mehdi_amini · August 11, 2022, 2:42pm

Well that logic seems to apply in the other direction equally well, doesn’t it?
“We only do that once every half decade or so, so the process incurring disruption for bot owners and downstream who aren’t following closely enough Discourse isn’t a particularly worrying concern”.

AaronBallman · August 11, 2022, 3:04pm

I guess I view it differently. We have quite a few bots, but we don’t have a lot of redundancy in our bots. So when we lose “enough” of one particular configuration, it’s easy for us to basically have lost almost all testing for that situation. If that happens for a day or two, it’s not a big deal (that happens as a natural course of us working on things, though we try to avoid it), but if it goes on for a significant amount of time the risk of serious disruption continues to increase. Sometimes that disruption may really only impact a downstream, but other times that disruption might be “there’s a new crash that only happens on Windows and there’s 1000 patches that could be the culprit, so who is responsible for fixing this”. Once a bot is red, any subsequent unrelated failures are very easy to miss.

Also, some bot owners are more likely to be in IT functions (managing servers, doing upgrades, etc) rather than development functions (following our Discourse, looking at bot status, etc). So I think a downstream not paying attention is a bit different than a bot owner not paying attention.

davidbolvansky · August 11, 2022, 3:38pm

Do we have any bots which uses the oldest supported compiler versions?

To ensure that what LLVM “promises” really works, in this case LLVM is always buildable with GCC 7.1, Clang 5.0 and MSVC 16.7. Atleast one with GCC 7.1 would be useful to catch issues like eg. Cannot compile with gcc < 8.1 · Issue #57057 · llvm/llvm-project · GitHub

mehdi_amini · August 11, 2022, 3:54pm

I used to have a bot on gcc 5.2 but we moved to gcc 8 right now, we could downgrade to gcc 7.1 if needed. However none of this is a guarantee about our support: for example we hit many bugs in gcc 7 when the gcc 5 bot was perfectly happy.
To me the minimum supported version is mostly useful as the line at which you don’t revert or accept patches to workaround compiler bugs anymore.

tschuett · August 11, 2022, 4:14pm

The most annoying thing of the upgrade were deprecated features in C++17 that caused a lot of warnings. Even without starting to use the new features.

reinterpretcast · August 11, 2022, 4:30pm

That’s a good point: Just switching the standard level may force code changes to unbreak bots with deprecation-as-warnings. The replacement functionality might be only in the new standard…

tschuett · August 11, 2022, 4:53pm

There is a deprecated std::iterator still in this file:

github.com

llvm/llvm-project/blob/main/bolt/include/bolt/Core/BinaryContext.h

//===- bolt/Core/BinaryContext.h - Low-level context ------------*- C++ -*-===//
//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//
//===----------------------------------------------------------------------===//
//
// Context for processing binary executable/library files.
//
//===----------------------------------------------------------------------===//

#ifndef BOLT_CORE_BINARY_CONTEXT_H
#define BOLT_CORE_BINARY_CONTEXT_H

#include "bolt/Core/BinaryData.h"
#include "bolt/Core/BinarySection.h"
#include "bolt/Core/DebugData.h"
#include "bolt/Core/JumpTable.h"
#include "bolt/Core/MCPlusBuilder.h"

This file has been truncated. show original

dyung · August 11, 2022, 6:53pm

As a result of this, I was thinking it could possibly be useful to have a small email list of bot maintainers, both public and private. That way changes like this could be proactively sent out to people who would be dealing with the fallout and help to get issues flushed out beforehand.

tobiashieta · August 11, 2022, 6:55pm

Yeah the discourse way to do this is to have all bot owners “follow” a category, then we can post bot specific stuff to that category.

tschuett · August 11, 2022, 8:44pm

After the soft error, we should encourage some/random people to smoke test the new standard to prepare the codebase and eliminate deprecated features early, i.e., before they reach the bots.

Topic		Replies	Views
[RFC] Toolchain update policy (migrating LLVM past C++11) LLVM Dev List Archives	10	117	January 18, 2019
[RFC] Developer Policy for LLVM C API LLVM Dev List Archives	87	326	September 15, 2015
RFC: Increasing the GCC and Clang requirements to support C++17 in LLVM LLVM Project	40	6336	April 2, 2022
[RFC] Proposed Policy for committing to LLVM-libc C	15	622	February 6, 2024
LLVM Weekly - #450, August 15th 2022 Newsletters llvm-weekly	0	508	August 15, 2022

RFC: Updates to developer policy around C++ standards bump

Related topics