Testing compiler reliability using Csmith

Radu_Ometita · December 7, 2018, 6:15am

Hello everyone!

We are working on writing a paper about testing the reliability of C compilers by using Csmith (a random C99 program generator).

A previous testing effort, using Csmith, found 202 LLVM bugs, which represented 2% of all reported bugs at that time (PDF: https://www.flux.utah.edu/download?uid=114): . However, after this paper was published we are unaware of any further testing using Csmith, and we would like to ask you if you are aware of any such efforts or further results.

Best regards,
Radu Ometita,
Functional compilers engineer @IOHK

vedantk · December 7, 2018, 6:41pm

Hello,

Zhendong, who’s done a lot of work on automated testing of llvm.

Hello everyone!

We are working on writing a paper about testing the reliability of C compilers by using Csmith (a random C99 program generator).

A previous testing effort, using Csmith, found 202 LLVM bugs, which represented 2% of all reported bugs at that time (PDF: https://www.flux.utah.edu/download?uid=114): . However, after this paper was published we are unaware of any further testing using Csmith, and we would like to ask you if you are aware of any such efforts or further results.

Just speaking for myself here, I use Csmith as part of my pre-commit testing.

vedant

Zhendong_Su · December 7, 2018, 7:04pm

Thanks, Vedant. Yes, we have done a lot of testing of Clang/LLVM (and GCC) in the past several years (more details at https://people.inf.ethz.ch/suz/emi/index.html):

[GCC/LLVM bugs: 1,602 (total) / 1,007 (fixed)]
[Reports: GCC (link1, link2, link3, link4, link5), LLVM (link1, link2, link3, link4, link5)]

Best,
Zhendong

preames · December 7, 2018, 7:19pm

There’s also a couple other active fuzzing efforts running concurrently:

OSSFuzz is continuously running a set of codegen and individual pass fuzzers. A bit more information can be found here:
We (Azul Systems) have a continuously running Java fuzzer which exercises LLVM through our Falcon compiler, and regularly finds regressions and occasional deep long lurking issues. We don’t have a public bug tracker for this, but a sizable portion of our upstream bug fixing activity is driven by the output of this tool.

Philip

MattPD · December 12, 2018, 4:54pm

You may also be interested in the following resources on compilers correctness (articles, software, and talks – from the general topics to the ones specifically focused on testing, validation, and verification):

Best regards,

Matt P. Dziubinski

_Johannes_Doerfert · December 24, 2018, 1:10pm

I just saw this related paper today on Hacker News:

https://news.ycombinator.com/item?id=18748193

MattPD · December 25, 2018, 3:26pm

Hi Johannes!

Updated; thank you!

Best,

Matt

Topic		Replies	Views
random testing LLVM Dev List Archives	0	60	May 10, 2005
[GSoC 2019] Apply the Clang Static Analyzer to LLVM-based projects - final report LLVM Dev List Archives	6	94	September 16, 2019
quantitative comparison of correctness of llvm-gcc 2.x versions LLVM Dev List Archives	7	72	November 21, 2008
testing a cross compiler LLVM Dev List Archives	1	57	February 16, 2013
Regarding fuzzing llvm-ir passes LLVM Dev List Archives	3	99	July 20, 2021

Testing compiler reliability using Csmith

Related Topics