Highly Incremental: A Simple Programmatic Approach for Many Objectives

March 4, 2026 · 7 min read

Caesar Developer

Our new paper "Highly Incremental: A Simple Programmatic Approach for Many Objectives" by Philipp Schröer and Joost-Pieter Katoen presents a one-fits-all approach for quantitative objectives on probabilistic programs, applicable to verification with Caesar.

The paper will be presented at The 27th International Symposium on Formal Methods (FM 2026) in Tokyo, Japan.

A preprint with a detailed appendix of proofs and case studies is available on arXiv: "Highly Incremental: A Simple Programmatic Approach for Many Objectives (Extended Version)".

In this post, we:

show how reward statements model runtime and related quantities.
explain why verification of variance is difficult for runtime objectives.
introduce our sound program transformation for these objectives, including higher moments, tail probabilities, CDFs, expected excess, and moment-generating functions.
work through a Caesar example for the second moment of runtime.

Programmatic Reward Modeling

The paper explores what can be encoded into probabilistic programs with reward statements, which collect a certain quantity — rewards — during execution. We then consider expected rewards, i.e. the expected value of rewards collected over the distribution of program executions. This approach enables more applications for Caesar's weakest pre-expectation reasoning out-of-the-box.

We call this technique programmatic reward modeling and start with simple examples. First, we observe that we do not need dedicated post-expectation specifications, but can encode rewards on termination by simply writing a reward statement at the end of the program. From there, it is simple to generalize to runtime modeling and the more general resource consumption modeling by adding reward statements in between program statements.

By using auxiliary variables that are not used in the concrete program's execution — called ghost variables —, we can reason about more objectives. Discounting (future rewards matter less), step-indexed (reward at time point $N$ ), and even the expected number of visits, first visit times, and first return times, can all be modeled this way too.

A Challenge: Higher Moments of Runtimes

Expected runtime alone is often not enough: two programs can have the same expected runtime but very different runtime variability. To capture that difference, we need higher moments: the second moment $\mathbb{E}(\tau^2)$ captures the expected value of the square of the runtime, and the variance is $\mathbb{E}(\tau^2) - \mathbb{E}(\tau)^2$ .

Consider the three different runtime encodings in the following figure.

(a) Termination reward (runtime)

var tau: UInt = 0
while b {
  tau = tau + 1
  S'
}
reward tau

(b) Termination reward (squared runtime)

var tau: UInt = 0
while b {
  tau = tau + 1
  S'
}
reward tau * tau

(c) Incremental reward (runtime)

while b {
  reward 1
  S'
}

Three runtime encodings with reward statements. (a) and (b) collect tau and tau * tau on termination, while (c) collects reward 1 incrementally at each loop iteration.

For runtime objectives, only (c) is sound for all programs. This becomes clear for b = true (i.e. while true), which has infinite runtime. This is correctly captured by (c), which has an infinite expected reward, while (a) and (b) have an expected reward of $0$ because the final reward statement is never reached.

However, we'd like to retain the simplicity of being able to transform reward models like (a) into (b) to analyze higher moments. This is the key motivation for our programmatic reward transformations, which we introduce next.

Programmatic Reward Transformations

We introduce programmatic reward transformations. Instead of inventing a separate reward model for each objective, we start from a sound incremental reward model and transform it systematically.

Given a program $P$ and a monotonic function $f$ between rewards, our transformation constructs a program $\mathcal{T}_f(P)$ whose expected reward is transformed by $f$ . For higher moments, we choose $f(\tau) = \tau^k$ , so an expected reward $\mathbb{E}(\tau)$ becomes $\mathbb{E}(\tau^k)$ . This directly captures quantities such as the second moment, which is useful for analyzing runtime variance.

The idea is shown in the following sketch:

(a) Original incremental reward model

while b {
  reward r
  S'
}

(b) Transformed model $\mathcal{T}_f(P)$

var tau: EUReal = 0
while b {
  reward f(tau + r) - f(tau)
  tau = tau + r
  S'
}

Sketch of the transformation from a base reward model $P$ to $\mathcal{T}_f(P)$ . A ghost variable tau tracks accumulated reward, and each transformed step adds the increment induced by $f$ . For $f(\tau)=\tau^2$ , this yields a sound encoding of second moments.

The transformation carefully constructs incremental rewards so that the result is correct even for programs that do not terminate with probability $1$ , in particular diverging programs that accumulate infinite rewards. We show soundness of the transformation formally in the paper.

We also show how our program transformation recovers other objectives: higher moments via $f(\tau) = \tau^k$ , threshold/tail probabilities via $f_N(\tau) = [\tau \ge N]$ , CDF values by complement (equivalently via $f_N(\tau) = [\tau \le N]$ ), expected excess via $f_N(\tau) = \tau \ominus N$ , and moment-generating functions via $f_\lambda(\tau) = \exp(\lambda \tau)$ .

Finally, we show how our transformation can be generalized to multi-reward settings, where the program keeps track of multiple rewards structures like runtimes and memory usage. This enables objectives like expected runtime-memory products, relevant for many cloud billing models.

Example with Caesar: Second Moment of Runtime

Consider a simple program that sends requests to a database. These requests can fail with some probability, in which case the program retries until it succeeds.

The following HeyVL program succeeds with probability $0.5$ on each try and models runtime with reward 1 per retry. Using the @invariant annotation and the pre-attribute on the coproc, we show that the expected runtime is at most $2$ .

coproc retry_expected_runtime() -> (done: Bool)
    pre 2
    post 0
{
    done = false
    @invariant([!done] * 2)
    while !done {
        reward 1
        done = flip(0.5)
    }
}

Contrast this with the following program that succeeds with probability $\frac{2}{3}$ , but incurs an additional constant cost of $0.5$ at startup. The expected value is also $2$ . So first moments cannot distinguish the two programs.

coproc retry2_expected_runtime() -> (done: Bool)
    pre 2
    post 0
{
    done = false
    reward 0.5
    @invariant([!done] * 1.5)
    while !done {
        reward 1
        done = flip(2/3)
    }
}

These programs have different runtime distributions. Applying our program transformation with $f(\tau) = \tau^2$ lets us analyze the second moment, $\mathbb{E}(\tau^2)$ .

We introduce a ghost variable tau to track the original runtime and increment the reward by the second-moment change at each step. This makes the transformed program's total reward equal to the original program's second moment, even for non-terminating executions.

Below is the transformed program for the first example. Its second moment is $6$ , and the per-step reward increment is $(\tau + 1)^2 - \tau^2 = 2 \cdot \tau + 1$ .

coproc retry_second_moment() -> (done: Bool)
    pre 6
    post 0
{
    done = false
    var tau: UInt = 0
    @invariant([!done] * (6 + 4 * tau))
    while !done {
        reward (tau + 1) * (tau + 1) - tau * tau
        tau = tau + 1
        done = flip(0.5)
    }
}

For the second example, the second moment is $4.75$ , so the runtime distribution is more concentrated around its mean.

coproc retry2_second_moment() -> (done: Bool)
    pre 4.75
    post 0
{
    done = false
    var tau: UReal = 0.5
    reward 0.25
    @invariant([!done] * (4.5 + 3 * (tau - 0.5)))
    while !done {
        reward (tau + 1) * (tau + 1) - tau * tau
        tau = tau + 1
        done = flip(2/3)
    }
}

Both programs have expected runtime $2$ , but their variances are $6 - 2^2 = 2$ and $4.75 - 2^2 = 0.75$ , so the second program is more predictable.

Programmatic Reward Modeling​

A Challenge: Higher Moments of Runtimes​