LibC++ v3.8 - Problems with ISO C wrapper headers

Martin_J_O_Riordan1 · January 26, 2016, 8:09pm

And C++ also requires that some parts of the interfaces from C are presented to C++ as functions - examples being ‘fpclassify’, ‘signbit’ etc. These have to be handled differently, and I think that the current LibC++ approach to these is good and maintains semantic compatibility with C.

I had intended to build a patch file of my changes today in case anyone is interested, but other things got me busy. I’ll do that tomorrow and attach it to this thread.

I think that it is a good idea to have LibC++ provide its own wrapper headers for the C headers. It is a logical place to deal with portability issues that arise when referring to C headers provided by newlib, uclibc or glibc (and other less well known implementations). The handling of these portability issues will mean that the ‘c’ prefixed C++ headers will need little or no alteration and just maintain the ‘std’ namespace.

MartinO

David_Chisnall3 · January 27, 2016, 9:24am

I note that none of your listed libc implementations are the defaults on the operating systems that ship libc++ as the default C++ standard library implementation…

It would be very helpful for people who are working on this to provide a set of recommendations for C library maintainers to make C++ interoperability easier. That way, those of us who do have control over the libc headers can work from the other end to improve the situation.

David

Martin_J_O_Riordan1 · January 27, 2016, 3:22pm

I suppose I am more focussed on embedded systems than hosted - thus the reference to 'newlib'; and I'm sure that there are many alternative STDC libraries that I have never heard of. But I think that the LibC++ wrapper headers is still a good place to abstract such portability issues so that users of LibC++ are unaware of the ISO C header implementation that lies beneath.

I have also attached a patch file computed against the #258931 revision on:

https://llvm.org/svn/llvm-project/libcxx/branches/release_38/include

Although many files have changed, the nature of the changes is quite simple.

o In each of the '<cXXXX>' files I have inserted:

#define _LIBCPP_INCLUDING_STDC_HEADER

before each '#include <####.h>' ISO C header, and followed by:

#undef _LIBCPP_INCLUDING_STDC_HEADER

     I have done this for all LibC++ headers that include an ISO C header even if
     it does not refer to a LibC++ wrapping header, in case one C header
     includes another in any particular ISO C header implementation.

o In each of the '<XXXX.h>' ISO C header wrappers, where appropriate I have
replaced:

       #ifdef __cplusplus
       ...
       #endif // __cplusplus

with:

       #ifdef _LIBCPP_INCLUDING_STDC_HEADER
       ...
       #endif // _LIBCPP_INCLUDING_STDC_HEADER

o Before the C++ overloaded functions are introduced at global scope, I have
inserted:

_LIBCPP_BEGIN_NAMESPACE_STD

and afterwards:

_LIBCPP_END_NAMESPACE_STD

  o Finally, in '<math.h>' in particular, I have prefixed the forwarded
     names with '::' since at this point in the 'using ::name;' has not yet
     been seen.

The patch is a suggestion, and the '_LIBCPP_INCLUDING_STDC_HEADER' name I picked are just my idea for following the existing convention used by the LibC++ maintainers.

MartinO

stdc-wrappers.patch (23.9 KB)

jyknight · January 27, 2016, 3:28pm

This doesn’t seem right – won’t it break code that does this:

#include <math.h>

#include

? (since the “#ifdef _LIBCPP_INCLUDING_STDC_HEADER” is within the “#ifndef _LIBCPP_MATH_H” block, and thus only gets checked once)

Martin_J_O_Riordan1 · January 27, 2016, 3:41pm

Hmm! In order for it to matter, the program would have to do:

#include <math.h>

#include

which I think is unlikely, but:

#include <math.h>

#include “otherfile.h”

where ‘otherfile.h’ then includes ‘’, which is more likely to happen when ‘’ is included by proxy after ‘<math.h>’. But you are correct, I hadn’t thought of this and the pattern would need to be refined to address this. I did not come across this scenario in the LibC++ test-suite or my own.

Thanks,

MartinO

Martin_J_O_Riordan1 · February 2, 2016, 10:29am

Sorry for the delay getting back to this. I am attaching a revised patch (with respect to #259486) that addresses the issue that James commented on. The test case:

test/std/depr/depr.c.headers/math_h.pass.cpp

fails with these changes, but this is because the test is expecting the names in ‘<math.h>’ to be overloaded in the global namespace. I also made some additional changes to ‘<stdio.h>’ to wrap the macros for ‘getc’, ‘putc’, etc. using the same pattern that is used to achieve the same task in ‘<math.h>’. The ‘#undef’s for these was causing link failures for against our C library which does not provide callable functions for these and instead uses the macros provided by Newlib. I liked the pattern used in ‘<math.h>’ and thought that it neatly addressed the requirements of C++. Similar changes are probably a good idea for ‘<ctype.h>’ and ‘<wctype.h>’, but I did not implement these.

Thanks,

MartinO

stdc-wrappers.patch (31 KB)

Richard_Smith · February 2, 2016, 8:16pm

This is not really the time to be sending patches; we have not yet
established what the desired direction is. Even if we agreed that this
was the right direction and that we wanted to regress our conformance
here, this patch is not acceptable as it breaks modules support. (It
also breaks the include guard optimization for many of libc++'s
headers.)

Richard_Smith · February 2, 2016, 8:22pm

Also, see C++ Standard Library Defect Reports and Accepted Issues where
LWG agreed that libc++'s current behavior is the desired behavior.

jyknight · February 2, 2016, 8:44pm

The issue at hand is whether #include "math.h" may add the extra overloads to the global namespace, not whether #include <cmath> may do so. I've got no idea what the right answer is, but those do seem -- at least to me -- to be potentially distinct questions.

Richard_Smith · February 2, 2016, 9:23pm

The issue at hand is whether #include "math.h" may add the extra overloads to the global namespace, not whether #include <cmath> may do so. I've got no idea what the right answer is, but those do seem -- at least to me -- to be potentially distinct questions.

Yes, sorry, that issue doesn't say what I thought it did.

Martin_J_O_Riordan1 · February 3, 2016, 10:35am

The problem is that this *does* break existing code.

While I flagged the problem initially when I got ambiguities with '__fp16', this was simply due to the fact that every 6 months we align our out-of-tree implementation to the forthcoming formal numbered branch. In this case I was actually updating from v3.7.1 to v3.8.

It is our intention to start tracking head, but we're not there yet or I would have caught this a lot earlier in its development.

When I am stabilising after one of these big-bang updates, I first of all address all the compile-time regressions in our test-suites. The '__fp16' ambiguity simple revealed to me that ISO C names were now overloaded at the global namespace, even though '__fp16' is not meaningful to ISO C++.

But since then, I have been debugging runtime regressions, and these are far more difficult to debug.

A lot of the real-world application of our processor is math intensive, so the math libraries get hit on a lot. But performance is also critical and programmers trade-off using 'float' and 'double' quite often for a variety of reasons. Many of these math applications started off as FORTRAN and then were carefully ported and tuned to C. It is with great difficulty that we try to get programmers to use C++, and when they do so, is usually so that they can get better abstraction for their existing code (encapsulation and templates collections mostly).

The main reasons that programmers chose between 'float' and 'double' (assume 'float' is IEEE FP32 and 'double' is IEEE FP64) are:

  o Precision - when the problem needs to be more precise, they use 'double'
  o Dynamic Range - when the numbers are very large, they use 'double'
  o Space - large data sets take up a lot of memory, and programmers often
     choose to compromise precision for space by using 'float'
  o Performance - on systems where FP32 arithmetic is more performant than
     FP64 arithmetic, programmers will sometimes sacrifice the precision of
     'double' for the performance of 'float'

I don't see a lot of programmers using FP128 'long double', not sure why not, but I guess that the loss of performance versus the gain in precision and dynamic range is not a viable trade-off for their programs. We also have extensive use of FP16 where the precision and dynamic range take a back seat to raw "good enough" performance and probably space too.

But I will illustrate the kind of RWC that is broken by overloading the ISO C names in the global space. The following is a reduction of a real example that regressed at runtime with this change, this is from a modelling library and is optimised for FP32. I have culled the actual code, and left just the control-flow logic:

// Inputs: float inputA; float inputB;
double upperRange = pow(inputA, inputB);

  if (isnan(upperRange))
    handleNanErrors(inputA, inputB);
  else if (isinf(upperRange))
    handleOutOfRangeErrors(inputA, inputB);
  else if ((upperRange <= __FLT_MAX__) && (upperRange >= __FLT_MIN__))
    useFP32OptimisedModel(upperRange, inputA, inputB);
  else
    useHighDynamicRangeModel(upperRange, inputA, inputB);

The observed problem is that the "useHighDynamicRangeModel" implementation was never being executed, and instead "handleOutOfRangeErrors" was being called far more often than previously.

When I analysed this, the problem is down to the overloading of '::pow'. With ISO C, the function 'pow' takes two 'double' arguments, and returns a value of type 'double'. The two input operands are 'float', so they are first promoted to 'double', and then 'pow(double,double)' is called, which yields a value of type 'double'.

With the overloading at the global namespace, the function '::pow(float,float)' is called, which in turn calls '::powf(float,float)' yielding a 'float' value which is then promoted to 'double' to initialise 'upperRange'.

But 'float' does not have the dynamic range of 'double', and many pairs of input operands of type 'float' yield results that exceed the dynamic range of 'float' and 'pow(float,float)' returns INFINITY which is retained on promotion to 'double'. This in turn make 'isinf(upperRange)' evaluate to true when the value exceeds the dynamic range of FP32 and not when it exceeds the dynamic range of FP64 as was intended and expected by the program.

The original code was complaint ISO C, that was then migrated to C++, but as with the majority of legacy C programs, the changes made to operate with C++ are very minimal.

You could say "just put a cast to 'double' before the input operands" - and this will of course work in this case. But it took me a few hours to find this problem buried as it was in the actual code, and of course I was blaming our target lowering as the probable cause until I realised that it was not a code-generation bug. And I expect that related problems are going to be liberally strewn throughout legacy C code that is migrated or partially migrated to C++ in ways that will be very difficult to detect.

In my opinion, this was never the intent of the C++ Standard Committee, and I think that it is also an undesirable interpretation of the words in the Standard that will only server to further alienate C programmers from using C++; and this is already a difficult task in the embedded programming space where the majority of C programmers already hear enough FUD about C++ and performance.

My attachment of a patch at this stage is because I think that this is a significant enough issue for LibC++ v3.8 that it warrants urgent attention. I am also aware that it will probably not get resolved unless somebody does some work to make it happen, and I have done some of that work which may serve as a starting point for others to complete (LibC++ maintainers). I am not experiencing failures in the LibC++ test-suite from these changes with the exception of the 'depr.c.headers' test that I already mentioned and I have also not seen any failures in modules.

And these changes have addressed the regressions in the RWC that I use for verification.

While I admit the changes are not the ideal solution, I think that they represent an adequate minimalistic change to the sources that will offset the majority of the problems, but I am already working on a rewrite of '<cmath>/<math.h>' that I think will produce a better solution that addresses the objectives of C++ while retaining the intentions of ISO C compatibility. And of course the other ISO C wrapper headers. But I won't have this done in time for v3.8 code freeze.

All the best,

MartinO

Topic		Replies	Views
[PATCH] Added support to libc++ for GCC 4.3, GCC 4.5, and most recent head of Clang. Clang Frontend	5	143	August 6, 2010
LibC++ v3.8 - Problems with ISO C wrapper headers Clang Frontend	19	95	January 26, 2016
libc++ in Windows Clang Frontend	0	128	June 27, 2013
RFC: Clang and libstdc++ versions, string.h and cstring conflicts, and glibc patches. Clang Frontend	2	87	September 12, 2013
[LibC++] Incompatibility between '<stdio.h> from LibC++ v5.0's ' Newlib v2.5.0, and a workaround/fix Clang Frontend	2	83	September 21, 2017

LibC++ v3.8 - Problems with ISO C wrapper headers

Related topics