Formal Verification of Aave V3 Liquidation Boundaries: Worst-Case Bad Debt Thresholds

claudec · March 21, 2026, 1:33pm

Formal Verification of Aave V3 Liquidation Boundaries: Worst-Case Bad Debt Thresholds

Summary

I built Verifi, an open-source tool that uses Z3 theorem proving to compute the exact price shock at which bad debt becomes logically possible in Aave V3 markets. Not probable, but possible under any worst-case combination of oracle staleness, liquidator behavior, and position leverage.

Key finding: On Aave V3 Arbitrum, WETH bad debt becomes possible at approximately a 15% single-block price crash when a liquidator is present, or ~20% when absent. The worst historical single-block ETH drop was ~12%. This leaves a safety margin of approximately 3–4 percentage points against the binding failure mode.

To be direct about why this matters: Aave’s parameter sizing should not be consistent with bad debt at any shock magnitude that has historical precedent. A ~3–4pp margin means the protocol is one unusually large single-block shock away from a scenario where current parameters do not preclude bad debt. Not probabilistically, but as a consequence of the liquidation bonus arithmetic.

The methodology is complementary to simulation-based approaches. Where stochastic models (such as the conditional protocol equity framework recently published by Chaos Labs) answer “what parameter setting maximizes expected welfare?”, formal verification answers “below what shock threshold is bad debt provably impossible, regardless of probability distribution?”

Source code, tests, and full methodology: https://github.com/ClaudeCarlsson/verifi

Methodology

The verifier encodes Aave V3’s bad-debt conditions as constraints in the Z3 SMT solver. For each market, it constructs a symbolic state space (price, shock magnitude, supply, debt, oracle delay, liquidator presence, gas cost) subject to on-chain protocol parameters read live from Arbitrum contracts (PoolDataProvider, AaveOracle, Pool). It then asks Z3:

“Does there exist ANY assignment of these variables, within bounds, that results in bad debt?”

UNSAT → proven safe: no scenario within the bounded state space produces bad debt.
SAT → Z3 returns a concrete counterexample: exact price, exact supply/debt levels, exact oracle delay.

This is exhaustive proof over a continuous state space, not Monte Carlo sampling. Z3’s QF_NRA (quantifier-free nonlinear real arithmetic) theory is decidable for this constraint class.

A binary search (8 iterations, ~0.4pp precision) pinpoints the shock magnitude where UNSAT flips to SAT: the boundary.

A note on precision. All reported boundaries should be read as ±0.4pp. Two independent binary searches on the same parameters can converge to points up to ~0.8pp apart. Where this matters I report both values and use approximate language (“~15%”) rather than implying sub-percentage-point precision. A finer-grained search (16 iterations, ~0.2pp) is planned to tighten these ranges.

What the model captures

Single-block instantaneous price shocks
Chainlink oracle staleness (symbolic, bounded by heartbeat)
Oracle deviation thresholds
Two bad-debt failure modes: (1) liquidator present but bonus overconsumes collateral, (2) liquidator absent and position goes underwater
Supply and borrow anchored within ±10% of live on-chain values
Protocol parameters (LTV, liquidation threshold, bonus, caps) read directly from contracts

What the model does NOT capture

Multi-block cascades. The boundary assumes an atomic shock. Cascading liquidations across blocks, where liquidator collateral sales push prices further down, could breach this threshold through individually smaller drops. However, this cuts both ways: the model also does not capture stabilizing flows (arbitrageurs, new liquidators entering, borrower repayments) that dampen cascades in practice.
Binary liquidator. Real liquidator behavior is continuous: partial clears, gas competition, MEV dynamics, DEX liquidity constraints. The model uses True/False.
No cross-market contagion. Each market is verified independently. Correlated liquidations across WETH/WBTC/etc. sharing liquidity pools are not modeled.
Static supply/borrow. Dynamic position management during stress (withdrawals, repayments) is not captured.
Max-leverage worst case. The ~15% boundary assumes positions at max LTV (80%). The LTV sensitivity table below shows how the boundary changes at lower leverage.

An important framing point: even though a ~15% atomic single-block drop is a tail event, the question is not whether it is likely. The question is whether the protocol’s parameters are sufficient to preclude bad debt when it occurs. Simulation-based approaches can assess the probability and expected cost. Formal verification establishes the threshold below which bad debt is provably impossible. Both are needed.

Findings

On-chain data read live from Aave V3 Arbitrum via public RPCs. Prices, utilization, and supply/borrow levels change continuously. Run make analyze against the repo for current values.

Market Overview

17 markets scanned. Six non-stablecoin markets shown below (full scan output in the repo):

Asset         Price   Util  LiqThr  WithLiq   NoLiq   HistMargin
WETH         $2,157   85%   0.840    15.5%    20.1%     +3.5pp   THIN
WBTC        $70,697   10%   0.780    82.9%    84.5%    +70.9pp   WIDE
LINK             $9    2%   0.750    > 99%    > 99%     > 87pp   WIDE
ARB          $0.10     7%   0.630    > 99%    > 99%     > 87pp   WIDE
rETH         $2,502    2%   0.740    > 99%    > 99%     > 87pp   WIDE
weETH        $2,353    0%   0.770    > 99%    > 99%     > 87pp   WIDE

WithLiq = liquidator forced present. Bad debt occurs via bonus overconsumption: the liquidator seizes more collateral than exists after the price shock.
NoLiq = liquidator forced absent. Bad debt occurs via an uncleared underwater position.
HistMargin = distance between WithLiq and the worst historical single-block drop (~12%).

The binding failure mode for WETH is bonus overconsumption (WithLiq at 15.5%), not absent liquidators (NoLiq at 20.1%). The 5% liquidation bonus reduces the safety boundary by 4.6pp compared to a world where no liquidator acts at all, a direct quantification of the leakage-safety tradeoff.

WETH is the only “THIN” margin asset. This is consistent with its 85% utilization and 0.84 liquidation threshold leaving minimal buffer.

Counterexample

When Z3 finds the boundary, it returns a concrete scenario. The counterexample below is from the boundary search where Z3 is free to choose both liquidator presence and oracle delay. It selected liquidator present, confirming that bonus overconsumption is the binding path:

WETH at $1,047 drops 16.0% to $879
Oracle stale: 0s (of 3600s heartbeat)
Liquidator present: True
Collateral post-shock: $148,922,785
Debt: $141,857,480
Bad debt: $27,569

Cross-check: Seized by liquidator = $141,857,480 × 1.05 = $148,950,354. This exceeds post-shock collateral of $148,922,785 by $27,569. Bad debt confirmed.

Oracle staleness is 0s, meaning bad debt occurs even with a perfectly fresh oracle. The binding constraint is the liquidation bonus mechanics, not oracle delay.

LTV Sensitivity

The boundary depends sharply on leverage. This table shows the WithLiq (bonus overconsumption) boundary at each LTV level:

Asset       LTV 40%    LTV 50%    LTV 60%    LTV 70%    LTV 80%
WETH          > 99%      47.7%      37.0%      26.3%      15.5%

The analytical formula predicts d > 1 − λ(1 + β). At 80% LTV this gives 16.0%, consistent with the Z3 result of 15.5% within search precision. At 60% it gives 37.0%, an exact match.

At 40% LTV, the analytical formula predicts ~58%, but Z3 reports > 99%. This is because the Z3 model anchors supply and borrow near observed on-chain values (±10%). At 85% current utilization, the achievable debt-to-collateral ratio within the ±10% anchoring band at a hypothetical 40% LTV is far below what the formula assumes. The market would need to be operating near 40% LTV in practice for that analytical bound to be reachable. This is actually a conservative property: Z3 only flags risk where the on-chain state could plausibly support it. For governance, the 80% LTV row is the relevant one, since that is where max-leverage positions actually exist.

If most WETH positions are at 60% LTV rather than max, the effective boundary is 37%, a much wider margin. I am building an automated pipeline to pull position-level LTV distributions from the Aave subgraph to weight the boundary by actual leverage rather than worst-case max.

Liquidation Bonus Sensitivity

Higher bonus attracts liquidators faster but eats into collateral margin. This tradeoff is quantified:

Bonus  WithLiq   NoLiq   Gap
  3%    17.8%    20.9%  -3.1pp
  5%    14.8%    20.9%  -6.1pp  ← current
  7%    14.8%    20.9%  -6.1pp
 10%    11.7%    20.9%  -9.2pp
 15%     8.7%    20.9%  -12.2pp

NoLiq is invariant to bonus, as expected. If no liquidator acts, the bonus is irrelevant. The entire gap between WithLiq and NoLiq is attributable to the bonus.

A sharp-eyed reader will notice that the main table reports WithLiq = 15.5% at the current 5% bonus, while this table shows 14.8%. Both are correct outputs from independent binary search runs. The ~0.7pp difference is within the combined ±0.4pp precision of two 8-iteration searches converging on slightly different regions of the state space. Similarly, the 5% and 7% rows show the same WithLiq (14.8%), though the analytical formula predicts distinct values (16.0% and 14.4%). I treat the boundary as approximately 15% ± 1pp at the current 5% bonus, and plan to tighten this with a 16-iteration search (~0.2pp precision) in the next version.

Reducing the bonus from 5% to 3% recovers approximately 3pp of safety margin, moving WithLiq from ~15% to ~18%. This is a concrete, actionable parameter change that could be evaluated. I am not proposing a specific change here. The bonus also affects liquidator incentive strength, and the optimal tradeoff requires the kind of simulation-based welfare analysis that Chaos Labs’ equity uplift framework provides. But formal verification establishes the hard constraint: at the current 5% bonus, the system cannot preclude bad debt beyond approximately 15%; at 3%, beyond approximately 18%. Any simulation-derived optimal bonus should be evaluated against this floor.

Historical Context

Event                    Date         Daily Drop  Block Drop (est.)
May 2021 (China ban)     2021-05-19       34%        ~12%
Jun 2022 (3AC/Luna)      2022-06-13       21%        ~8%
Nov 2022 (FTX)           2022-11-09       17%        ~6%
Mar 2023 (SVB)           2023-03-10       10%        ~4%
Aug 2024 (Yen carry)     2024-08-05       22%        ~9%
Oct 2025 (Tariffs)       2025-10-11       20%        ~12%

Single-block drops are estimates derived from Chainlink oracle update patterns during crash windows, not direct block-by-block price reconstruction. I am building a pipeline to replay latestRoundData() block-by-block during these events for precise measurement. The ~12% figure and the ~3pp margin should be treated as approximate until that data is available.

The October 2025 tariff crash is particularly relevant: Chaos Labs’ post-mortem reported ETH dropping ~20% daily with SVR oracle delays of up to 5 blocks causing 30-40% price dislocations on some feeds. Aave concluded net positive ($1.5M profit on $180M liquidations, ~$500k bad debt), but the event demonstrates that large single-block shocks are not theoretical. They happen, and the margin between “net positive” and “bad debt” is thinner than it appears.

Relationship to Simulation-Based Approaches

The Chaos Labs liquidation parameter analysis optimizes the liquidation bonus by maximizing conditional protocol equity uplift:

ΔE = fD(1 − b(1−r))

This is an expected-value optimization over simulated GARCH price trajectories with explicit DEX liquidity modeling via a synthetic Uniswap V3 AMM. It is the right tool for choosing parameters that maximize protocol welfare under realistic conditions.

Formal verification asks a different question: what is the hard boundary below which no parameter setting can produce bad debt?

Feature	Simulation	Formal Verification
Question	What bonus maximizes expected equity uplift?	At what shock is bad debt provably impossible?
Method	GARCH price trajectories, synthetic AMM, Monte Carlo	Z3 SMT solver, exhaustive proof over continuous state space
Output	Optimal parameter + distributional statistics	Hard boundary + concrete counterexample
Strength	Captures realistic frictions (slippage, gas, MEV)	Guarantees no scenario is missed within model bounds
Limitation	Finite samples; tail events may be underrepresented	Single-block; no dynamic frictions

Data Sources and Reproducibility

All inputs are free, no API keys required. The tool reads directly from Aave V3 Arbitrum contracts via public RPCs:

Source	Contract	Data
PoolDataProvider	`0x69FA688f...`	LTV, liquidation threshold, bonus, caps
Pool	`0x794a6135...`	Total supply, total borrow, utilization
AaveOracle	`0xb56c2F0B...`	Current asset prices

Everything runs in Docker (make analyze ASSETS=WETH).

Appendix: Analytical Derivation

For readers interested in the mathematical foundation, the boundary has a clean analytical form at max leverage.

Bad debt with liquidator (bonus overconsumption):

The liquidator seizes collateral worth B × (1 + β). Bad debt occurs when this exceeds total post-shock collateral:

p(1-d) · S < B · (1 + β)

At max leverage, B = S · p · λ:

p(1-d) · S < S · p · λ · (1 + β)
(1-d) < λ(1 + β)
d > 1 − λ(1 + β)

For WETH (λ = 0.80, β = 0.05): d > 1 − 0.84 = 16.0%

Bad debt without liquidator:

No liquidation occurs. Bad debt when collateral < debt:

p(1-d) · S < S · p · λ
d > 1 − λ

For WETH: d > 1 − 0.80 = 20.0%

Z3 reports WithLiq in the range of 14.8%–15.5% across independent search runs, consistent with the analytical 16.0% within the ±0.4pp binary search precision plus state-space exploration effects (Z3 also searches over supply/borrow anchoring bands and oracle deviation, which the closed-form ignores).

Topic		Replies	Views
Optimal Liquidation Parameters, Conditional Value at Risk, and Equity Characterizations in Aave v3 and v4 Risk	2	336	March 16, 2026
Projection Finance — Free & open-source simulator to project your Aave position over time Risk	0	90	June 9, 2026
[ARFC] Upgrade PT Risk Oracle to Protocol-Owned Infrastructure on CRE Governance	1	198	June 27, 2026
[TEMP CHECK] Post-rsETH Collateral Framework: Tier-Based LTV Reductions and Wrap-Depth Ineligibility Limits Risk	16	978	June 16, 2026
[ARFC] Aave Risk Framework Risk	6	1538	June 18, 2026

Formal Verification of Aave V3 Liquidation Boundaries: Worst-Case Bad Debt Thresholds

Formal Verification of Aave V3 Liquidation Boundaries: Worst-Case Bad Debt Thresholds

Summary

Methodology

What the model captures

What the model does NOT capture

Findings

Market Overview

Counterexample

LTV Sensitivity

Liquidation Bonus Sensitivity

Historical Context

Relationship to Simulation-Based Approaches

Data Sources and Reproducibility

Appendix: Analytical Derivation

Related topics