esbmc-vampire-project

This repo contains benchmarks and other data relating to the ongoing ESBMC - Vampire integration

Description of Project

ESBMC works by bounded model checking ¹. As part of this process, it converts the input C program into GOTO format, then carries out symbolic execution on this GOTO program to create Single Static Assignment (SSA) form. During the symbolic execution, loops are unwound to some depth k. In this integration, we treat loops differently. To understand the approach we take, consider the following program:

int main() {
  // variable declarations
  int n = __VERIFIER_nondet_int();
  int v1 = __VERIFIER_nondet_int();
  int v2 = __VERIFIER_nondet_int();
  int v3 = __VERIFIER_nondet_int();
  int x;
  // pre-conditions
  (x = 0);
  // loop body

  __invariant(x <= n || n < 0);  
  while ((x < n)) {
    {
    (x  = (x + 1));
    }

  }

  // post-condition
if ( (x != n) )
__VERIFIER_assert( (n < 0) );

}

Note that we use a special function __invariant(...) to mark candidate invariants. These statements must appear on the line(s) before a loop.

When symbolic execution reaches the loop statement, it attempts to prove that the candidate invariant holds (base case). The loop variables are havoced (set to non-deterministic values) and the loop is then symbolically executed a single time. Taking the resulting loop SSA and assuming the inductive hypothesis, we attempt to prove the inductive conclusion. If this succeeds we have proven the candidate invariant to be a true invariant. The loop is replaced with the candidate invariant and symbolic execution continues as normal.

Note, that we support having multiple candidate invariants for a loop. For example, the following is valid:

int main() {
  __invariant(cand1);
  __invariant(cand2);
  __invariant(cand3); 
  while ((x < n)) {
    ...
  }
}

In such a case, the order in which we attempt to prove the candidates to be invariants may affect what can be proved. To deal with this we make multiple proof passes through the invariants and stop when either all candidates have been proven, or we do not prove any new invariants on a particular pass.

Proving Invariants

Currently all attempts to prove an invariant are channeled to the Vampire theorem prover. There is no particular reason to select Vampire, any theorem prover could be used here. However, by using a superposition based theorem prover, we are able to handle quantified invariants. To this end, a pair of special functions __forall(...) and __exists(...) are used. For example:

int main() {

  int n =  1000000;
  int a[n];  
  int i = 0;

  int x; 
  __invariant(n == 1000000);
  __invariant(0 <= i && i <= n);
  __invariant(__forall((void*)(&x), (!((x < i)  && (x>= 0)) || a[x] == x) ));
  while(i < n){
    a[i] = i;
    i++;
  }

  __VERIFIER_assert(__exists((void*)(&x), (a[x] == 500)));

}

This is a very short-term solution. A better long-term solution would be to integrate a specification language such ACSL ² into ESBMC.

Code

All modifications to ESBMC relating to this project can be found on the following branch:

https://github.com/esbmc/esbmc/tree/ahmed-vampire-for-loops

To build ESBMC, please follow the build instructions in the ESBMC repo. To run ESBMC with Vampire, the following command should be used:

<path to ESBMC> <path to benchmark> --vampire-for-loops --ir --output <output file name> --vampire-path <path to Vampire executable> --no-bounds-check

It is necessary to run with option --ir since Vampire cannot handle bit vectors currently. It is also important to add option --no-bounds-check as other wise ESBMC adds extra assertions potentially within a loop unrolling. Currently, we cannot handle assertions within a loop (please view TODO list). The option --output is used to specify the name of a file to which SMT problems are written. At times it can be useful to add an abort() statement to ESBMC to stop execution at a particular point and inspect the content of this file. This allows us to analyse the data being passed from ESBMC to Vampire.

So far, we have been using the main branch of Vampire to carry out tests. Vampire is invoked via a system call as it does not have a working API currently.

What is in this Repo

Besides for this README, this repo contains a set of benchmarks that I have been using to test the implementation. Some of these are handcrafted and many others come from:

https://github.com/PL-ML/code2inv/tree/master/benchmarks/C_instances/c

Useful Papers

The following is a list of papers that have been useful so far or may be useful in the future of this project. Please be aware that this selection is very limited and there are of course many other relevant papers. The fields of invariant generation and program verification are huge.

Code2Inv: A Deep Learning Framework for Program Verification
LEMUR: Integrating Large Language Models in Automated Program Verification I think that this paper is very important and it gives me a lot of hope that the approach that we are taking could be very competitive.
Scalable and Precise Verification Based on k-induction
Loop Invariants, Analysis, Classification and Examples
Learning Shape Analysis
From Invariant Checking to Invariant Inference Using Randomized Search
Can ChatGPT Support CPA-Checker with ...
Learning Loop Invariants for Program Verification
Neural Termination Analysis

TODO List

The basic implementation now seems to be working reasonably well (albeit tested over a small set of relatively simple benchmarks). However, there are number of tasks outstanding of varying levels of importance:

Helpful Hints

The ESBMC code base is pretty large and it takes some time to become familiar with it. Here are some things that I wish I knew before I started work on the codebase:

ESBMC mantains two internal representations irep and irep2. irep is the older of the two, based on strings, and actually the easier to use. irep2 is heavily based on templates and uses some fancy Boost macros to produce a lot of boiler plate code. To convert between the two there are various utility functions located in here. Both can be printed out using a dump() method. This produces a detailed AST for the expression. To convert the internal representation back to something that resembles C code, we can use this function or the one below it.
The log_debug(...) function only produces output if verbosity is set high enough. Thus, in some cases it makes sense to use log_status(...) or just a plain old std::cout << ... instead.
If anything in the above is unclear, I can always be contacted by email: ahmed_bhayat at hotmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Benchmarks		Benchmarks
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

esbmc-vampire-project

Description of Project

Proving Invariants

Code

What is in this Repo

Useful Papers

TODO List

Helpful Hints

About

Releases

Packages

Contributors 2

Languages

License

ibnyusuf/esbmc-vampire-project

Folders and files

Latest commit

History

Repository files navigation

esbmc-vampire-project

Description of Project

Proving Invariants

Code

What is in this Repo

Useful Papers

TODO List

Helpful Hints

Footnotes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages