Soundness of Proof Rules

Using a proof rule to verify a HeyVL program can introduce approximations compared to the original program semantics. These approximations influence guarantees Caesar gives during verification:

Sound verification: verification succeeds $\implies$ property holds for original program.
Sound refutation: verification fails $\implies$ property does not hold for original program (counterexample is valid).

Without sound verification, a property may not hold for the original program (unsound verification). On the other hand, without sound refutations, a counter-example may be spurious (not valid in the original program). However, it still is a counter-example to verification, which can be used to e.g. improve loop invariants.

Most program verifiers aim for sound verifications, but Caesar also supports sound refutations. This is useful when you want to show that a certain expectation is not a lower or upper bound.

This guide gives an overview of Caesar's proof rules and how they affect soundness of verifications and refutations. Caesar gives diagnostics when a verification or refutation might be unsound because of an incompatible proof-rule/calculus setup. For an overview of what Caesar does and does not check automatically, see What Is Checked By Caesar.

Quick Overview

To verify or refute a lower/upper bound (proc/coproc), the proof rules must induce the right approximation (Over/Under) between original semantics ( $orig$ ) and verification condition semantics ( $vc$ ). The cards below are the quick reference; the detailed rule mapping appears later in Proof Rule Approximations.

Quick GuidePick approximation by goal and procedure kind.

Verification · proc(lower bounds)

Approximation: Under

LFP (@wp, @ert)

@unroll, @omega_invariant, @ost, @ast

GFP (@wlp)

calls, @invariant, @k_induction

Verification · coproc(upper bounds)

Approximation: Over

LFP (@wp, @ert)

calls, @invariant, @k_induction, @past

GFP (@wlp)

@unroll, @omega_invariant

Refutation · proc(lower bounds)

Approximation: Over

LFP (@wp, @ert)

calls, @invariant, @k_induction, @past

GFP (@wlp)

@unroll, @omega_invariant

Refutation · coproc(upper bounds)

Approximation: Under

LFP (@wp, @ert)

@unroll, @omega_invariant, @ost, @ast

GFP (@wlp)

calls, @invariant, @k_induction

Example: Sound and Unsound Proofs and Refutations

All four examples share the same geometric loop and @invariant annotation; only kind (proc/coproc) and pre differ, so each one is an Over approximation of the original loop semantics.

Sound Proof
Sound Refutation
Unsound Proof
Unsound Refutation

Caesar reports sound verification: We show that init_x + 2 is an upper bound (coproc) on the expected value of x on termination (@wp). The actual expected value is init_x + 1.

@wp coproc sound_proof(init_x: UInt) -> (x: UInt)
    pre init_x + 2
    post x
{
    x = init_x
    var cont: Bool = true
    @invariant(ite(cont, x + 1, x))
    while cont {
        cont = flip(0.5)
        if cont {x = x + 1 } else { }
    }
}

Caesar reports a counter-example to the property: We refute that init_x + 2 is a lower bound (proc) on the expected value of x on termination (@wp). The actual expected value is init_x + 1.

@wp proc sound_refutation(init_x: UInt) -> (x: UInt)
    pre init_x + 2
    post x
{
    x = init_x
    var cont: Bool = true
    @invariant(ite(cont, x + 1, x))
    while cont {
        cont = flip(0.5)
        if cont { x = x + 1 } else {}
    }
}

Caesar reports unsound verification: We want to verify that init_x + 1 is a lower bound (proc) on the expected value of x on termination (@wp), but Park induction does not give us this guarantee.

@wp proc unsound_proof(init_x: UInt) -> (x: UInt)
    pre init_x + 1
    post x
{
    x = init_x
    var cont: Bool = true
    @invariant(ite(cont, x + 1, x))
    while cont {
        cont = flip(0.5)
        if cont { x = x + 1 } else {}
    }
}

Caesar reports a counter-example to verification: it is a counterexample against init_x + 1 as an upper bound (coproc) on the expected value of x on termination (@wp). But this is only because the @invariant(1) is not inductive (as Caesar reports), not because the specification does not hold.

@wp coproc unsound_refutation(init_x: UInt) -> (x: UInt)
    pre init_x + 1
    post x
{
    x = init_x
    var cont: Bool = true
    @invariant(1)
    while cont {
        cont = flip(0.5)
        if cont { x = x + 1 } else {}
    }
}

Original Program Semantics

The original semantics ( $orig$ ) is the semantics of the high-level program, which may feature constructs such as loops and recursion. During verification, we want to obtain sound results with respect to the original semantics. There are different ways to define the original semantics, but here we are mainly concerned with how they differ in their treatment of nonterminating loop and recursion runs.

A Zoo of Original Semantics

We distinguish:

Least Fixed Point (LFP) Semantics: while loops and recursive calls are interpreted via least fixed points.
- This is used in calculi such as $wp$ and $ert$ .
- In short: "nonterminating runs contribute post $0$ to the expected value".
Greatest Fixed Point (GFP) Semantics: while loops and recursive calls are interpreted via greatest fixed points.
- This is used in calculi such as $wlp$ .
- In short: "nonterminating runs contribute post $1$ to the expected value".

Calculus Annotations

Caesar supports procedure annotations to make the intended calculus explicit:

@wp: weakest pre-expectation calculus (least fixed points, nontermination contributes 0).
@wlp: weakest liberal pre-expectation calculus (greatest fixed points, nontermination contributes 1).
@ert: expected runtime calculus (least fixed points).

These annotations let Caesar check additional soundness conditions for proof-rule usage.

How Caesar Selects Original Semantics

If a calculus annotation (@wp, @wlp, or @ert) is present on a (co)proc:
- LFP for @wp and @ert,
- GFP for @wlp.
Otherwise selected by the proof rule so that verifications are sound (see Proof Rule Approximations).
- E.g. for Induction, GFP semantics are used for procs and LFP semantics for coprocs.
- E.g. for Loop Unrolling, LFP semantics are used for procs and GFP semantics for coprocs.

Approximations, Proofs, and Refutations

Correct results — sound verification or sound refutation — depend on how we approximate the original semantics ( $orig$ ) by the verification condition semantics ( $vc$ ). The verification condition semantics is the semantics of the verification conditions that Caesar generates and reasons about.

Kinds of Approximations

Arrows indicate one approximation being stronger than the other.

We distinguish four kinds of approximations between the original semantics ( $orig$ ) and the verification condition semantics ( $vc$ ). Let $S$ be a HeyVL statement and $X$ be a post-expectation.

Exact: No approximation is performed.
Formally: $orig[S](X) = vc[S](X)$ for all $X$ .
Under: The pre-expectation is approximated from below.
Formally: $orig[S](X) \geq vc[S](X)$ for all $X$ .
Over: The pre-expectation is approximated from above.
Formally: $vc[S](X) \geq orig[S](X)$ for all $X$ .
Unknown: None of the above holds.

Approximations are compositional¹. For example, the sequential composition $S_1;~ S_2$ of two Exact statements $S_1, S_2$ is also Exact. Same for Over and Under. Combining Exact with either Over or Under yields the respective approximation. However, combining Over and Under statements results in an Unknown approximation.

Sound Verifications

When Caesar says that a (co)proc is verified, then a bound on verification condition semantics ( $vc$ ) has been established.

In a proc, Caesar reasons about lower bounds $pre$ of the verification condition semantics ( $vc$ ), i.e. $pre \leq vc[S](post)$ .

If $vc[S](post)$ is an Under approximation of the original semantics ( $orig$ ), then we have sound verifications:

\texttt{pre} \;\leq\; vc[S](post) \;\leq\; orig[S](post).

Dually, in a coproc, Caesar reasons about upper bounds $pre$ of the verification condition semantics ( $vc$ ), i.e. $pre \geq vc[S](post)$ .

If $vc[S](post)$ is an Over approximation of the original semantics ( $orig$ ), then we have sound verifications:

\texttt{pre} \;\geq\; vc[S](post) \;\geq\; orig[S](post).

The results also apply for Exact approximations.

Sound Refutations

When Caesar gives a counterexample for a (co)proc, then a bound on verification condition semantics ( $vc$ ) has been refuted.

In a proc, a refutation means showing that $pre \nleq vc[S](post)$ .

If $vc[S](post)$ is an Over approximation of the original semantics ( $orig$ ), then we have sound refutations because $pre$ cannot be a lower bound of $orig[S](post)$ :

\texttt{pre} \;\nleq\; vc[S](post) \;\land\; vc[S](post) \;\geq\; orig[S](post) \;\Rightarrow\; \texttt{pre} \;\nleq\; orig[S](post).

Dually, in a coproc, a refutation means showing that $pre \ngeq vc[S](post)$ .

If $vc[S](post)$ is an Under approximation of the original semantics ( $orig$ ), then we have sound refutations because $pre$ cannot be an upper bound of $orig[S](post)$ :

\texttt{pre} \;\ngeq\; vc[S](post) \;\land\; vc[S](post) \;\leq\; orig[S](post) \;\Rightarrow\; \texttt{pre} \;\ngeq\; orig[S](post).

The results also apply for Exact approximations.

Formal Proof: Sound Refutations in procs

Let a proc not verify ( $pre \nleq vc[S](post)$ ) and let $vc[S](post)$ be an Over approximation ( $vc[S](post) \geq orig[S](post)$ ). For the sake of a contradiction, assume $pre \leq orig[S](post)$ .

By transitivity with $orig[S](post) \leq vc[S](post)$ , we get $pre \;\leq\; orig[S](post) \;\leq\; vc[S](post)$ , hence $pre \leq vc[S](post)$ , contradicting $pre \nleq vc[S](post)$ . Therefore $pre \nleq orig[S](post)$ .

Proof Rule Approximations

As explained above, proof rules such as @invariant may introduce approximations between the original semantics ( $orig$ ) and the verification condition semantics ( $vc$ ). Below, we summarize which of Caesar's built-in proof rules induce which approximations, and thus which proof rules are applicable for sound verification/refutation of lower/upper bounds.

Original Semantics	Approximation	Applicable Proof Rules
LFP	Over	Procedure Calls (k)-Induction (`@invariant`, `@k_induction`) Positive Almost-Sure Termination Rule (`@past`)
LFP	Under	Loop Unrolling (`@unroll`) ω-invariants (`@omega_invariant`) Optional Stopping Theorem (`@ost`) Almost-Sure Termination Rule (`@ast`)
GFP	Over	Loop Unrolling (`@unroll`) ω-invariants (`@omega_invariant`)
GFP	Under	Procedure Calls (k)-Induction (`@invariant`, `@k_induction`)

For loops, the loop-body approximation must be compatible with the chosen rule. For example, for k-Induction under LFP, the loop body must be Over so the whole loop is Over. Additionally, the rules Almost-Sure Termination Rule, Positive Almost-Sure Termination Rule, and Optional Stopping Theorem require the loop body to be Exact.

What Is Checked By Caesar

HeyVL is designed as an intermediate verification language and intentionally allows dangerous constructs. See our OOPSLA '23 paper for more background.

Caesar checks many proof-rule soundness conditions automatically, but not all modeling assumptions.

Hard errors:

In calculus-annotated procedures, calling a procedure with a conflicting calculus annotation is rejected.
Potentially recursive calls are rejected where Park induction is not sound (@wp proc, @wlp coproc, @ert proc).

Diagnostics during verification:

Caesar tracks approximation information per procedure, errors when a proof result may be unsound, and marks counterexamples as potentially spurious when they may not be valid.

Not checked:

Contradictions make verification trivially succeed — e.g., assume ?(false) in a proc; contradictory axioms are a common source.
There is no enforcement that @ert procedures contain tick statements, nor that @wp/@wlp procedures do not.

Compositionality of approximations follows from the fact that the semantics of HeyVL's statements is monotonic with respect to the expectation ordering. The only exception are non-monotonic negation statements, which always yield an Unknown approximation. ↩

Quick Overview​

Original Program Semantics​

A Zoo of Original Semantics​

Calculus Annotations​

How Caesar Selects Original Semantics​

Approximations, Proofs, and Refutations​

Kinds of Approximations​

Sound Verifications​

Sound Refutations​

Proof Rule Approximations​

What Is Checked By Caesar​

Footnotes​

Quick Overview

Original Program Semantics

A Zoo of Original Semantics

Calculus Annotations

How Caesar Selects Original Semantics

Approximations, Proofs, and Refutations

Kinds of Approximations

Sound Verifications

Sound Refutations

Proof Rule Approximations

What Is Checked By Caesar

Footnotes