2021-03-09 10:34:55 +00:00
.. _formal_verification:
##################################
SMTChecker and Formal Verification
##################################
Using formal verification it is possible to perform an automated mathematical
proof that your source code fulfills a certain formal specification.
The specification is still formal (just as the source code), but usually much
simpler.
Note that formal verification itself can only help you understand the
difference between what you did (the specification) and how you did it
(the actual implementation). You still need to check whether the specification
is what you wanted and that you did not miss any unintended effects of it.
Solidity implements a formal verification approach based on
2021-07-06 15:24:37 +00:00
`SMT (Satisfiability Modulo Theories) <https://en.wikipedia.org/wiki/Satisfiability_modulo_theories> `_ and
2021-03-09 10:34:55 +00:00
`Horn <https://en.wikipedia.org/wiki/Horn-satisfiability> `_ solving.
The SMTChecker module automatically tries to prove that the code satisfies the
specification given by `` require `` and `` assert `` statements. That is, it considers
`` require `` statements as assumptions and tries to prove that the conditions
inside `` assert `` statements are always true. If an assertion failure is
found, a counterexample may be given to the user showing how the assertion can
be violated. If no warning is given by the SMTChecker for a property,
it means that the property is safe.
The other verification targets that the SMTChecker checks at compile time are:
- Arithmetic underflow and overflow.
- Division by zero.
- Trivial conditions and unreachable code.
- Popping an empty array.
2021-03-23 18:15:14 +00:00
- Out of bounds index access.
2021-03-09 10:34:55 +00:00
- Insufficient funds for a transfer.
2021-07-07 10:31:09 +00:00
All the targets above are automatically checked by default if all engines are
enabled, except underflow and overflow for Solidity >=0.8.7.
2021-03-09 10:34:55 +00:00
The potential warnings that the SMTChecker reports are:
- `` <failing property> happens here. `` . This means that the SMTChecker proved that a certain property fails. A counterexample may be given, however in complex situations it may also not show a counterexample. This result may also be a false positive in certain cases, when the SMT encoding adds abstractions for Solidity code that is either hard or impossible to express.
- `` <failing property> might happen here `` . This means that the solver could not prove either case within the given timeout. Since the result is unknown, the SMTChecker reports the potential failure for soundness. This may be solved by increasing the query timeout, but the problem might also simply be too hard for the engine to solve.
2021-03-31 15:20:30 +00:00
To enable the SMTChecker, you must select :ref: `which engine should run<smtchecker_engines>` ,
where the default is no engine. Selecting the engine enables the SMTChecker on all files.
.. note ::
2021-07-14 16:56:55 +00:00
Prior to Solidity 0.8.4, the default way to enable the SMTChecker was via
`` pragma experimental SMTChecker; `` and only the contracts containing the
pragma would be analyzed. That pragma has been deprecated, and although it
still enables the SMTChecker for backwards compatibility, it will be removed
in Solidity 0.9.0. Note also that now using the pragma even in a single file
enables the SMTChecker for all files.
2021-03-09 10:34:55 +00:00
.. note ::
2021-07-14 16:56:55 +00:00
The lack of warnings for a verification target represents an undisputed
mathematical proof of correctness, assuming no bugs in the SMTChecker and
the underlying solver. Keep in mind that these problems are
*very hard* and sometimes *impossible* to solve automatically in the
general case. Therefore, several properties might not be solved or might
lead to false positives for large contracts. Every proven property should
be seen as an important achievement. For advanced users, see :ref: `SMTChecker Tuning <smtchecker_options>`
to learn a few options that might help proving more complex
properties.
2021-03-09 10:34:55 +00:00
***** ***
Tutorial
***** ***
Overflow
========
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Overflow {
uint immutable x;
uint immutable y;
2022-02-16 02:59:49 +00:00
function add(uint x_, uint y_) internal pure returns (uint) {
return x_ + y_;
2021-03-09 10:34:55 +00:00
}
2022-02-16 02:59:49 +00:00
constructor(uint x_, uint y_) {
(x, y) = (x_, y_);
2021-03-09 10:34:55 +00:00
}
function stateAdd() public view returns (uint) {
return add(x, y);
}
}
The contract above shows an overflow check example.
2021-07-07 10:31:09 +00:00
The SMTChecker does not check underflow and overflow by default for Solidity >=0.8.7,
so we need to use the command line option `` --model-checker-targets "underflow,overflow" ``
or the JSON option `` settings.modelChecker.targets = ["underflow", "overflow"] `` .
See :ref: `this section for targets configuration<smtchecker_targets>` .
2021-03-09 10:34:55 +00:00
Here, it reports the following:
2021-07-14 17:05:42 +00:00
.. code-block :: text
2021-03-09 10:34:55 +00:00
Warning: CHC: Overflow (resulting value larger than 2**256 - 1) happens here.
Counterexample:
x = 1, y = 115792089237316195423570985008687907853269984665640564039457584007913129639935
= 0
Transaction trace:
Overflow.constructor(1, 115792089237316195423570985008687907853269984665640564039457584007913129639935)
State: x = 1, y = 115792089237316195423570985008687907853269984665640564039457584007913129639935
Overflow.stateAdd()
Overflow.add(1, 115792089237316195423570985008687907853269984665640564039457584007913129639935) -- internal call
--> o.sol:9:20:
|
2022-02-16 02:59:49 +00:00
9 | return x_ + y_;
2021-03-09 10:34:55 +00:00
| ^^^^^^^
If we add `` require `` statements that filter out overflow cases,
the SMTChecker proves that no overflow is reachable (by not reporting warnings):
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Overflow {
uint immutable x;
uint immutable y;
2022-02-16 02:59:49 +00:00
function add(uint x_, uint y_) internal pure returns (uint) {
return x_ + y_;
2021-03-09 10:34:55 +00:00
}
2022-02-16 02:59:49 +00:00
constructor(uint x_, uint y_) {
(x, y) = (x_, y_);
2021-03-09 10:34:55 +00:00
}
function stateAdd() public view returns (uint) {
require(x < type(uint128).max);
require(y < type(uint128).max);
return add(x, y);
}
}
Assert
======
An assertion represents an invariant in your code: a property that must be true
*for all transactions, including all input and storage values* , otherwise there is a bug.
The code below defines a function `` f `` that guarantees no overflow.
Function `` inv `` defines the specification that `` f `` is monotonically increasing:
2022-02-16 02:59:49 +00:00
for every possible pair `` (a, b) `` , if `` b > a `` then `` f(b) > f(a) `` .
2021-03-09 10:34:55 +00:00
Since `` f `` is indeed monotonically increasing, the SMTChecker proves that our
property is correct. You are encouraged to play with the property and the function
definition to see what results come out!
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Monotonic {
2022-02-16 02:59:49 +00:00
function f(uint x) internal pure returns (uint) {
require(x < type(uint128).max);
return x * 42;
2021-03-09 10:34:55 +00:00
}
2022-02-16 02:59:49 +00:00
function inv(uint a, uint b) public pure {
require(b > a);
assert(f(b) > f(a));
2021-03-09 10:34:55 +00:00
}
}
We can also add assertions inside loops to verify more complicated properties.
The following code searches for the maximum element of an unrestricted array of
numbers, and asserts the property that the found element must be greater or
equal every element in the array.
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Max {
2022-02-16 02:59:49 +00:00
function max(uint[] memory a) public pure returns (uint) {
2021-03-09 10:34:55 +00:00
uint m = 0;
2022-02-16 02:59:49 +00:00
for (uint i = 0; i < a.length; ++i)
if (a[i] > m)
m = a[i];
2021-03-09 10:34:55 +00:00
2022-02-16 02:59:49 +00:00
for (uint i = 0; i < a.length; ++i)
assert(m >= a[i]);
2021-03-09 10:34:55 +00:00
return m;
}
}
Note that in this example the SMTChecker will automatically try to prove three properties:
1. `` ++i `` in the first loop does not overflow.
2. `` ++i `` in the second loop does not overflow.
3. The assertion is always true.
.. note ::
2021-07-14 16:56:55 +00:00
The properties involve loops, which makes it *much much* harder than the previous
examples, so beware of loops!
2021-03-09 10:34:55 +00:00
All the properties are correctly proven safe. Feel free to change the
properties and/or add restrictions on the array to see different results.
For example, changing the code to
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Max {
2022-02-16 02:59:49 +00:00
function max(uint[] memory a) public pure returns (uint) {
require(a.length >= 5);
2021-03-09 10:34:55 +00:00
uint m = 0;
2022-02-16 02:59:49 +00:00
for (uint i = 0; i < a.length; ++i)
if (a[i] > m)
m = a[i];
2021-03-09 10:34:55 +00:00
2022-02-16 02:59:49 +00:00
for (uint i = 0; i < a.length; ++i)
assert(m > a[i]);
2021-03-09 10:34:55 +00:00
return m;
}
}
gives us:
2021-07-14 17:05:42 +00:00
.. code-block :: text
2021-03-09 10:34:55 +00:00
2021-07-14 16:56:55 +00:00
Warning: CHC: Assertion violation happens here.
Counterexample:
2021-03-09 10:34:55 +00:00
2022-02-16 02:59:49 +00:00
a = [0, 0, 0, 0, 0]
2021-07-14 16:56:55 +00:00
= 0
2021-03-09 10:34:55 +00:00
2021-07-14 16:56:55 +00:00
Transaction trace:
Test.constructor()
Test.max([0, 0, 0, 0, 0])
--> max.sol:14:4:
|
2022-02-16 02:59:49 +00:00
14 | assert(m > a[i]);
2021-03-09 10:34:55 +00:00
State Properties
================
So far the examples only demonstrated the use of the SMTChecker over pure code,
proving properties about specific operations or algorithms.
A common type of properties in smart contracts are properties that involve the
state of the contract. Multiple transactions might be needed to make an assertion
fail for such a property.
As an example, consider a 2D grid where both axis have coordinates in the range (-2^128, 2^128 - 1).
Let us place a robot at position (0, 0). The robot can only move diagonally, one step at a time,
and cannot move outside the grid. The robot's state machine can be represented by the smart contract
below.
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
contract Robot {
int x = 0;
int y = 0;
modifier wall {
require(x > type(int128).min && x < type(int128).max);
require(y > type(int128).min && y < type(int128).max);
_;
}
function moveLeftUp() wall public {
--x;
++y;
}
function moveLeftDown() wall public {
--x;
--y;
}
function moveRightUp() wall public {
++x;
++y;
}
function moveRightDown() wall public {
++x;
--y;
}
function inv() public view {
assert((x + y) % 2 == 0);
}
}
Function `` inv `` represents an invariant of the state machine that `` x + y ``
must be even.
The SMTChecker manages to prove that regardless how many commands we give the
robot, even if infinitely many, the invariant can *never* fail. The interested
reader may want to prove that fact manually as well. Hint: this invariant is
inductive.
We can also trick the SMTChecker into giving us a path to a certain position we
think might be reachable. We can add the property that (2, 4) is *not*
reachable, by adding the following function.
.. code-block :: Solidity
function reach_2_4() public view {
assert(!(x == 2 && y == 4));
}
This property is false, and while proving that the property is false,
the SMTChecker tells us exactly *how* to reach (2, 4):
2021-07-14 17:05:42 +00:00
.. code-block :: text
2021-03-09 10:34:55 +00:00
2021-07-14 16:56:55 +00:00
Warning: CHC: Assertion violation happens here.
Counterexample:
x = 2, y = 4
Transaction trace:
Robot.constructor()
State: x = 0, y = 0
Robot.moveLeftUp()
State: x = (- 1), y = 1
Robot.moveRightUp()
State: x = 0, y = 2
Robot.moveRightUp()
State: x = 1, y = 3
Robot.moveRightUp()
State: x = 2, y = 4
Robot.reach_2_4()
--> r.sol:35:4:
|
35 | assert(!(x == 2 && y == 4));
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^
2021-03-09 10:34:55 +00:00
Note that the path above is not necessarily deterministic, as there are
other paths that could reach (2, 4). The choice of which path is shown
might change depending on the used solver, its version, or just randomly.
External Calls and Reentrancy
=============================
Every external call is treated as a call to unknown code by the SMTChecker.
The reasoning behind that is that even if the code of the called contract is
available at compile time, there is no guarantee that the deployed contract
will indeed be the same as the contract where the interface came from at
compile time.
In some cases, it is possible to automatically infer properties over state
variables that are still true even if the externally called code can do
anything, including reenter the caller contract.
.. code-block :: Solidity
2021-03-31 15:20:30 +00:00
// SPDX-License-Identifier: GPL-3.0
2021-03-09 10:34:55 +00:00
pragma solidity >=0.8.0;
interface Unknown {
2021-07-14 16:56:55 +00:00
function run() external;
2021-03-09 10:34:55 +00:00
}
contract Mutex {
2021-07-14 16:56:55 +00:00
uint x;
bool lock;
Unknown immutable unknown;
2022-02-16 02:59:49 +00:00
constructor(Unknown u) {
require(address(u) != address(0));
unknown = u;
2021-07-14 16:56:55 +00:00
}
modifier mutex {
require(!lock);
lock = true;
_;
lock = false;
}
2022-02-16 02:59:49 +00:00
function set(uint x_) mutex public {
x = x_;
2021-07-14 16:56:55 +00:00
}
function run() mutex public {
uint xPre = x;
unknown.run();
assert(xPre == x);
}
2021-03-09 10:34:55 +00:00
}
The example above shows a contract that uses a mutex flag to forbid reentrancy.
The solver is able to infer that when `` unknown.run() `` is called, the contract
is already "locked", so it would not be possible to change the value of `` x `` ,
regardless of what the unknown called code does.
If we "forget" to use the `` mutex `` modifier on function `` set `` , the
2021-10-06 09:53:20 +00:00
SMTChecker is able to synthesize the behaviour of the externally called code so
2021-03-09 10:34:55 +00:00
that the assertion fails:
2021-07-14 17:05:42 +00:00
.. code-block :: text
2021-03-09 10:34:55 +00:00
2021-07-14 16:56:55 +00:00
Warning: CHC: Assertion violation happens here.
Counterexample:
x = 1, lock = true, unknown = 1
2021-03-09 10:34:55 +00:00
2021-07-14 16:56:55 +00:00
Transaction trace:
Mutex.constructor(1)
State: x = 0, lock = false, unknown = 1
Mutex.run()
unknown.run() -- untrusted external call, synthesized as:
Mutex.set(1) -- reentrant call
--> m.sol:32:3:
|
32 | assert(xPre == x);
| ^^^^^^^^^^^^^^^^^
2021-03-09 10:34:55 +00:00
.. _smtchecker_options:
***** ***** ***** ***** ***** *** *
SMTChecker Options and Tuning
***** ***** ***** ***** ***** *** *
Timeout
=======
The SMTChecker uses a hardcoded resource limit (`` rlimit `` ) chosen per solver,
which is not precisely related to time. We chose the `` rlimit `` option as the default
because it gives more determinism guarantees than time inside the solver.
This options translates roughly to "a few seconds timeout" per query. Of course many properties
are very complex and need a lot of time to be solved, where determinism does not matter.
If the SMTChecker does not manage to solve the contract properties with the default `` rlimit `` ,
a timeout can be given in milliseconds via the CLI option `` --model-checker-timeout <time> `` or
the JSON option `` settings.modelChecker.timeout=<time> `` , where 0 means no timeout.
2021-07-07 10:31:09 +00:00
.. _smtchecker_targets:
2021-03-09 10:34:55 +00:00
Verification Targets
====================
2021-04-14 10:56:43 +00:00
The types of verification targets created by the SMTChecker can also be
customized via the CLI option `` --model-checker-target <targets> `` or the JSON
option `` settings.modelChecker.targets=<targets> `` .
In the CLI case, `` <targets> `` is a no-space-comma-separated list of one or
more verification targets, and an array of one or more targets as strings in
the JSON input.
The keywords that represent the targets are:
2021-03-09 10:34:55 +00:00
- Assertions: `` assert `` .
- Arithmetic underflow: `` underflow `` .
- Arithmetic overflow: `` overflow `` .
- Division by zero: `` divByZero `` .
- Trivial conditions and unreachable code: `` constantCondition `` .
- Popping an empty array: `` popEmptyArray `` .
2021-03-23 18:15:14 +00:00
- Out of bounds array/fixed bytes index access: `` outOfBounds `` .
2021-03-09 10:34:55 +00:00
- Insufficient funds for a transfer: `` balance `` .
2021-04-14 10:56:43 +00:00
- All of the above: `` default `` (CLI only).
2021-03-09 10:34:55 +00:00
A common subset of targets might be, for example:
`` --model-checker-targets assert,overflow `` .
2021-07-07 10:31:09 +00:00
All targets are checked by default, except underflow and overflow for Solidity >=0.8.7.
2021-03-09 10:34:55 +00:00
There is no precise heuristic on how and when to split verification targets,
but it can be useful especially when dealing with large contracts.
2023-02-09 16:07:13 +00:00
Proved Targets
==============
If there are any proved targets, the SMTChecker issues one warning per engine stating
how many targets were proved. If the user wishes to see all the specific
proved targets, the CLI option `` --model-checker-show-proved `` and
the JSON option `` settings.modelChecker.showProved = true `` can be used.
2021-07-28 16:58:03 +00:00
Unproved Targets
================
If there are any unproved targets, the SMTChecker issues one warning stating
how many unproved targets there are. If the user wishes to see all the specific
2021-08-11 08:54:03 +00:00
unproved targets, the CLI option `` --model-checker-show-unproved `` and
2021-07-28 16:58:03 +00:00
the JSON option `` settings.modelChecker.showUnproved = true `` can be used.
2023-03-06 13:19:58 +00:00
Unsupported Language Features
=============================
Certain Solidity language features are not completely supported by the SMT
encoding that the SMTChecker applies, for example assembly blocks.
The unsupported construct is abstracted via overapproximation to preserve
soundness, meaning any properties reported safe are safe even though this
feature is unsupported.
However such abstraction may cause false positives when the target properties
depend on the precise behavior of the unsupported feature.
If the encoder encounters such cases it will by default report a generic warning
stating how many unsupported features it has seen.
If the user wishes to see all the specific unsupported features, the CLI option
`` --model-checker-show-unsupported `` and the JSON option
`` settings.modelChecker.showUnsupported = true `` can be used, where their default
value is `` false `` .
2021-04-15 10:31:45 +00:00
Verified Contracts
==================
By default all the deployable contracts in the given sources are analyzed separately as
the one that will be deployed. This means that if a contract has many direct
and indirect inheritance parents, all of them will be analyzed on their own,
even though only the most derived will be accessed directly on the blockchain.
This causes an unnecessary burden on the SMTChecker and the solver. To aid
cases like this, users can specify which contracts should be analyzed as the
deployed one. The parent contracts are of course still analyzed, but only in
the context of the most derived contract, reducing the complexity of the
encoding and generated queries. Note that abstract contracts are by default
not analyzed as the most derived by the SMTChecker.
The chosen contracts can be given via a comma-separated list (whitespace is not
allowed) of <source>:<contract> pairs in the CLI:
`` --model-checker-contracts "<source1.sol:contract1>,<source2.sol:contract2>,<source2.sol:contract3>" `` ,
and via the object `` settings.modelChecker.contracts `` in the :ref: `JSON input<compiler-api>` ,
which has the following form:
2021-07-14 16:53:15 +00:00
.. code-block :: json
2021-04-15 10:31:45 +00:00
2021-07-14 16:53:15 +00:00
"contracts": {
2021-07-14 16:56:55 +00:00
"source1.sol": ["contract1"],
"source2.sol": ["contract2", "contract3"]
}
2021-04-15 10:31:45 +00:00
2021-10-12 09:12:18 +00:00
Trusted External Calls
======================
By default, the SMTChecker does not assume that compile-time available code
is the same as the runtime code for external calls. Take the following contracts
as an example:
.. code-block :: solidity
// SPDX-License-Identifier: GPL-3.0
pragma solidity >=0.8.0;
contract Ext {
uint public x;
function setX(uint _x) public { x = _x; }
}
contract MyContract {
function callExt(Ext _e) public {
_e.setX(42);
assert(_e.x() == 42);
}
}
When `` MyContract.callExt `` is called, an address is given as the argument.
At deployment time, we cannot know for sure that address `` _e `` actually
contains a deployment of contract `` Ext `` .
Therefore, the SMTChecker will warn that the assertion above can be violated,
which is true, if `` _e `` contains another contract than `` Ext `` .
However, it can be useful to treat these external calls as trusted, for example,
to test that different implementations of an interface conform to the same property.
This means assuming that address `` _e `` indeed was deployed as contract `` Ext `` .
This mode can be enabled via the CLI option `` --model-checker-ext-calls=trusted ``
or the JSON field `` settings.modelChecker.extCalls: "trusted" `` .
Please be aware that enabling this mode can make the SMTChecker analysis much more
computationally costly.
An important part of this mode is that it is applied to contract types and high
level external calls to contracts, and not low level calls such as `` call `` and
`` delegatecall `` . The storage of an address is stored per contract type, and
the SMTChecker assumes that an externally called contract has the type of the
caller expression. Therefore, casting an `` address `` or a contract to
different contract types will yield different storage values and can give
unsound results if the assumptions are inconsistent, such as the example below:
.. code-block :: solidity
// SPDX-License-Identifier: GPL-3.0
pragma solidity >=0.8.0;
contract D {
constructor(uint _x) { x = _x; }
uint public x;
function setX(uint _x) public { x = _x; }
}
contract E {
constructor() { x = 2; }
uint public x;
function setX(uint _x) public { x = _x; }
}
contract C {
function f() public {
address d = address(new D(42));
// `d` was deployed as `D` , so its `x` should be 42 now.
assert(D(d).x() == 42); // should hold
assert(D(d).x() == 43); // should fail
// E and D have the same interface, so the following
// call would also work at runtime.
// However, the change to `E(d)` is not reflected in `D(d)` .
E(d).setX(1024);
// Reading from `D(d)` now will show old values.
// The assertion below should fail at runtime,
// but succeeds in this mode's analysis (unsound).
assert(D(d).x() == 42);
// The assertion below should succeed at runtime,
// but fails in this mode's analysis (false positive).
assert(D(d).x() == 1024);
}
}
Due to the above, make sure that the trusted external calls to a certain
variable of `` address `` or `` contract `` type always have the same caller
expression type.
It is also helpful to cast the called contract's variable as the type of the
most derived type in case of inheritance.
.. code-block :: solidity
// SPDX-License-Identifier: GPL-3.0
pragma solidity >=0.8.0;
interface Token {
function balanceOf(address _a) external view returns (uint);
function transfer(address _to, uint _amt) external;
}
contract TokenCorrect is Token {
mapping (address => uint) balance;
constructor(address _a, uint _b) {
balance[_a] = _b;
}
function balanceOf(address _a) public view override returns (uint) {
return balance[_a];
}
function transfer(address _to, uint _amt) public override {
require(balance[msg.sender] >= _amt);
balance[msg.sender] -= _amt;
balance[_to] += _amt;
}
}
contract Test {
function property_transfer(address _token, address _to, uint _amt) public {
require(_to != address(this));
TokenCorrect t = TokenCorrect(_token);
uint xPre = t.balanceOf(address(this));
require(xPre >= _amt);
uint yPre = t.balanceOf(_to);
t.transfer(_to, _amt);
uint xPost = t.balanceOf(address(this));
uint yPost = t.balanceOf(_to);
assert(xPost == xPre - _amt);
assert(yPost == yPre + _amt);
}
}
Note that in function `` property_transfer `` , the external calls are
performed on variable `` t ``
Another caveat of this mode are calls to state variables of contract type
outside the analyzed contract. In the code below, even though `` B `` deploys
`` A `` , it is also possible for the address stored in `` B.a `` to be called by
anyone outside of `` B `` in between transactions to `` B `` itself. To reflect the
possible changes to `` B.a `` , the encoding allows an unbounded number of calls
to be made to `` B.a `` externally. The encoding will keep track of `` B.a `` 's
storage, therefore assertion (2) should hold. However, currently the encoding
allows such calls to be made from `` B `` conceptually, therefore assertion (3)
fails. Making the encoding stronger logically is an extension of the trusted
mode and is under development. Note that the encoding does not keep track of
storage for `` address `` variables, therefore if `` B.a `` had type `` address ``
the encoding would assume that its storage does not change in between
transactions to `` B `` .
.. code-block :: solidity
pragma solidity >=0.8.0;
contract A {
uint public x;
address immutable public owner;
constructor() {
owner = msg.sender;
}
function setX(uint _x) public {
require(msg.sender == owner);
x = _x;
}
}
contract B {
A a;
constructor() {
a = new A();
assert(a.x() == 0); // (1) should hold
}
function g() public view {
assert(a.owner() == address(this)); // (2) should hold
assert(a.x() == 0); // (3) should hold, but fails due to a false positive
}
}
2021-10-06 09:53:20 +00:00
Reported Inferred Inductive Invariants
======================================
For properties that were proved safe with the CHC engine,
the SMTChecker can retrieve inductive invariants that were inferred by the Horn
solver as part of the proof.
2023-02-23 12:09:12 +00:00
Currently only two types of invariants can be reported to the user:
2021-10-06 09:53:20 +00:00
- Contract Invariants: these are properties over the contract's state variables
that are true before and after every possible transaction that the contract may ever run. For example, `` x >= y `` , where `` x `` and `` y `` are a contract's state variables.
- Reentrancy Properties: they represent the behavior of the contract
in the presence of external calls to unknown code. These properties can express a relation
between the value of the state variables before and after the external call, where the external call is free to do anything, including making reentrant calls to the analyzed contract. Primed variables represent the state variables' values after said external call. Example: `` lock -> x = x' `` .
The user can choose the type of invariants to be reported using the CLI option `` --model-checker-invariants "contract,reentrancy" `` or as an array in the field `` settings.modelChecker.invariants `` in the :ref: `JSON input<compiler-api>` .
By default the SMTChecker does not report invariants.
2021-07-15 15:39:01 +00:00
Division and Modulo With Slack Variables
========================================
Spacer, the default Horn solver used by the SMTChecker, often dislikes division
and modulo operations inside Horn rules. Because of that, by default the
Solidity division and modulo operations are encoded using the constraint
`` a = b * d + m `` where `` d = a / b `` and `` m = a % b `` .
However, other solvers, such as Eldarica, prefer the syntactically precise operations.
The command line flag `` --model-checker-div-mod-no-slacks `` and the JSON option
`` settings.modelChecker.divModNoSlacks `` can be used to toggle the encoding
depending on the used solver preferences.
2021-03-31 15:20:30 +00:00
2021-04-30 14:53:48 +00:00
Natspec Function Abstraction
============================
Certain functions including common math methods such as `` pow ``
and `` sqrt `` may be too complex to be analyzed in a fully automated way.
These functions can be annotated with Natspec tags that indicate to the
SMTChecker that these functions should be abstracted. This means that the
body of the function is not used, and when called, the function will:
- Return a nondeterministic value, and either keep the state variables unchanged if the abstracted function is view/pure, or also set the state variables to nondeterministic values otherwise. This can be used via the annotation `` /// @custom:smtchecker abstract-function-nondet `` .
- Act as an uninterpreted function. This means that the semantics of the function (given by the body) are ignored, and the only property this function has is that given the same input it guarantees the same output. This is currently under development and will be available via the annotation `` /// @custom:smtchecker abstract-function-uf `` .
2021-07-15 15:39:01 +00:00
.. _smtchecker_engines:
2021-03-09 10:34:55 +00:00
Model Checking Engines
======================
The SMTChecker module implements two different reasoning engines, a Bounded
Model Checker (BMC) and a system of Constrained Horn Clauses (CHC). Both
engines are currently under development, and have different characteristics.
The engines are independent and every property warning states from which engine
it came. Note that all the examples above with counterexamples were
reported by CHC, the more powerful engine.
By default both engines are used, where CHC runs first, and every property that
was not proven is passed over to BMC. You can choose a specific engine via the CLI
option `` --model-checker-engine {all,bmc,chc,none} `` or the JSON option
`` settings.modelChecker.engine={all,bmc,chc,none} `` .
Bounded Model Checker (BMC)
---------------------------
The BMC engine analyzes functions in isolation, that is, it does not take the
overall behavior of the contract over multiple transactions into account when
analyzing each function. Loops are also ignored in this engine at the moment.
Internal function calls are inlined as long as they are not recursive, directly
or indirectly. External function calls are inlined if possible. Knowledge
that is potentially affected by reentrancy is erased.
The characteristics above make BMC prone to reporting false positives,
but it is also lightweight and should be able to quickly find small local bugs.
Constrained Horn Clauses (CHC)
------------------------------
A contract's Control Flow Graph (CFG) is modelled as a system of
Horn clauses, where the life cycle of the contract is represented by a loop
that can visit every public/external function non-deterministically. This way,
the behavior of the entire contract over an unbounded number of transactions
is taken into account when analyzing any function. Loops are fully supported
by this engine. Internal function calls are supported, and external function
calls assume the called code is unknown and can do anything.
The CHC engine is much more powerful than BMC in terms of what it can prove,
and might require more computing resources.
2021-05-19 15:35:19 +00:00
SMT and Horn solvers
====================
The two engines detailed above use automated theorem provers as their logical
backends. BMC uses an SMT solver, whereas CHC uses a Horn solver. Often the
same tool can act as both, as seen in `z3 <https://github.com/Z3Prover/z3> `_ ,
which is primarily an SMT solver and makes `Spacer
<https://spacer.bitbucket.io/> `_ available as a Horn solver, and ` Eldarica
<https://github.com/uuverifiers/eldarica> `_ which does both.
The user can choose which solvers should be used, if available, via the CLI
2022-05-15 15:45:43 +00:00
option `` --model-checker-solvers {all,cvc4,eld,smtlib2,z3} `` or the JSON option
2021-05-19 15:35:19 +00:00
`` settings.modelChecker.solvers=[smtlib2,z3] `` , where:
- `` cvc4 `` is only available if the `` solc `` binary is compiled with it. Only BMC uses `` cvc4 `` .
2022-05-15 15:45:43 +00:00
- `` eld `` is used via its binary which must be installed in the system. Only CHC uses `` eld `` , and only if `` z3 `` is not enabled.
2021-05-19 15:35:19 +00:00
- `` smtlib2 `` outputs SMT/Horn queries in the `smtlib2 <http://smtlib.cs.uiowa.edu/> `_ format.
These can be used together with the compiler's `callback mechanism <https://github.com/ethereum/solc-js> `_ so that
any solver binary from the system can be employed to synchronously return the results of the queries to the compiler.
This can be used by both BMC and CHC depending on which solvers are called.
- `` z3 `` is available
- if `` solc `` is compiled with it;
2022-05-15 15:45:43 +00:00
- if a dynamic `` z3 `` library of version >=4.8.x is installed in a Linux system (from Solidity 0.7.6);
2021-05-19 15:35:19 +00:00
- statically in `` soljson.js `` (from Solidity 0.6.9), that is, the Javascript binary of the compiler.
2022-05-11 15:16:10 +00:00
.. note ::
z3 version 4.8.16 broke ABI compatibility with previous versions and cannot
be used with solc <=0.8.13. If you are using z3 >=4.8.16 please use solc
2022-12-09 09:44:38 +00:00
>=0.8.14, and conversely, only use older z3 with older solc releases.
We also recommend using the latest z3 release which is what SMTChecker also does.
2022-05-11 15:16:10 +00:00
2021-05-19 15:35:19 +00:00
Since both BMC and CHC use `` z3 `` , and `` z3 `` is available in a greater variety
of environments, including in the browser, most users will almost never need to be
concerned about this option. More advanced users might apply this option to try
alternative solvers on more complex problems.
Please note that certain combinations of chosen engine and solver will lead to
the SMTChecker doing nothing, for example choosing CHC and `` cvc4 `` .
2021-03-09 10:34:55 +00:00
***** ***** ***** ***** ***** ***** *
Abstraction and False Positives
***** ***** ***** ***** ***** ***** *
The SMTChecker implements abstractions in an incomplete and sound way: If a bug
is reported, it might be a false positive introduced by abstractions (due to
erasing knowledge or using a non-precise type). If it determines that a
verification target is safe, it is indeed safe, that is, there are no false
negatives (unless there is a bug in the SMTChecker).
If a target cannot be proven you can try to help the solver by using the tuning
options in the previous section.
If you are sure of a false positive, adding `` require `` statements in the code
with more information may also give some more power to the solver.
SMT Encoding and Types
======================
The SMTChecker encoding tries to be as precise as possible, mapping Solidity types
and expressions to their closest `SMT-LIB <http://smtlib.cs.uiowa.edu/> `_
representation, as shown in the table below.
+-----------------------+--------------------------------+-----------------------------+
|Solidity type |SMT sort |Theories |
+=======================+================================+=============================+
|Boolean |Bool |Bool |
+-----------------------+--------------------------------+-----------------------------+
|intN, uintN, address, |Integer |LIA, NIA |
|bytesN, enum, contract | | |
+-----------------------+--------------------------------+-----------------------------+
|array, mapping, bytes, |Tuple |Datatypes, Arrays, LIA |
|string |(Array elements, Integer length)| |
+-----------------------+--------------------------------+-----------------------------+
|struct |Tuple |Datatypes |
+-----------------------+--------------------------------+-----------------------------+
|other types |Integer |LIA |
+-----------------------+--------------------------------+-----------------------------+
Types that are not yet supported are abstracted by a single 256-bit unsigned
integer, where their unsupported operations are ignored.
For more details on how the SMT encoding works internally, see the paper
2022-09-08 13:22:47 +00:00
`SMT-based Verification of Solidity Smart Contracts <https://github.com/chriseth/solidity_isola/blob/master/main.pdf> `_ .
2021-03-09 10:34:55 +00:00
Function Calls
==============
In the BMC engine, function calls to the same contract (or base contracts) are
inlined when possible, that is, when their implementation is available. Calls
to functions in other contracts are not inlined even if their code is
available, since we cannot guarantee that the actual deployed code is the same.
The CHC engine creates nonlinear Horn clauses that use summaries of the called
functions to support internal function calls. External function calls are treated
as calls to unknown code, including potential reentrant calls.
Complex pure functions are abstracted by an uninterpreted function (UF) over
the arguments.
+-----------------------------------+--------------------------------------+
|Functions |BMC/CHC behavior |
+===================================+======================================+
|`` assert `` |Verification target. |
+-----------------------------------+--------------------------------------+
|`` require `` |Assumption. |
+-----------------------------------+--------------------------------------+
|internal call |BMC: Inline function call. |
| |CHC: Function summaries. |
+-----------------------------------+--------------------------------------+
|external call to known code |BMC: Inline function call or |
| |erase knowledge about state variables |
| |and local storage references. |
| |CHC: Assume called code is unknown. |
| |Try to infer invariants that hold |
| |after the call returns. |
+-----------------------------------+--------------------------------------+
|Storage array push/pop |Supported precisely. |
| |Checks whether it is popping an |
| |empty array. |
+-----------------------------------+--------------------------------------+
|ABI functions |Abstracted with UF. |
+-----------------------------------+--------------------------------------+
|`` addmod `` , `` mulmod `` |Supported precisely. |
+-----------------------------------+--------------------------------------+
|`` gasleft `` , `` blockhash `` , |Abstracted with UF. |
|`` keccak256 `` , `` ecrecover `` | |
|`` ripemd160 `` | |
+-----------------------------------+--------------------------------------+
|pure functions without |Abstracted with UF |
|implementation (external or | |
|complex) | |
+-----------------------------------+--------------------------------------+
|external functions without |BMC: Erase state knowledge and assume |
|implementation |result is nondeterminisc. |
| |CHC: Nondeterministic summary. |
| |Try to infer invariants that hold |
| |after the call returns. |
+-----------------------------------+--------------------------------------+
|transfer |BMC: Checks whether the contract's |
| |balance is sufficient. |
| |CHC: does not yet perform the check. |
+-----------------------------------+--------------------------------------+
|others |Currently unsupported |
+-----------------------------------+--------------------------------------+
Using abstraction means loss of precise knowledge, but in many cases it does
not mean loss of proving power.
2021-06-25 10:25:29 +00:00
.. code-block :: solidity
2021-03-09 10:34:55 +00:00
// SPDX-License-Identifier: GPL-3.0
pragma solidity >=0.8.0;
contract Recover
{
function f(
bytes32 hash,
2022-02-16 02:59:49 +00:00
uint8 v1, uint8 v2,
bytes32 r1, bytes32 r2,
bytes32 s1, bytes32 s2
2021-03-09 10:34:55 +00:00
) public pure returns (address) {
2022-02-16 02:59:49 +00:00
address a1 = ecrecover(hash, v1, r1, s1);
require(v1 == v2);
require(r1 == r2);
require(s1 == s2);
address a2 = ecrecover(hash, v2, r2, s2);
2021-03-09 10:34:55 +00:00
assert(a1 == a2);
return a1;
}
}
In the example above, the SMTChecker is not expressive enough to actually
compute `` ecrecover `` , but by modelling the function calls as uninterpreted
functions we know that the return value is the same when called on equivalent
parameters. This is enough to prove that the assertion above is always true.
Abstracting a function call with an UF can be done for functions known to be
deterministic, and can be easily done for pure functions. It is however
difficult to do this with general external functions, since they might depend
on state variables.
Reference Types and Aliasing
============================
Solidity implements aliasing for reference types with the same :ref:`data
location<data-location>`.
That means one variable may be modified through a reference to the same data
area.
The SMTChecker does not keep track of which references refer to the same data.
This implies that whenever a local reference or state variable of reference
type is assigned, all knowledge regarding variables of the same type and data
location is erased.
If the type is nested, the knowledge removal also includes all the prefix base
types.
2021-06-25 10:25:29 +00:00
.. code-block :: solidity
2021-03-09 10:34:55 +00:00
// SPDX-License-Identifier: GPL-3.0
pragma solidity >=0.8.0;
contract Aliasing
{
uint[] array1;
uint[][] array2;
function f(
uint[] memory a,
uint[] memory b,
uint[][] memory c,
uint[] storage d
) internal {
array1[0] = 42;
a[0] = 2;
c[0][0] = 2;
b[0] = 1;
// Erasing knowledge about memory references should not
// erase knowledge about state variables.
assert(array1[0] == 42);
// However, an assignment to a storage reference will erase
// storage knowledge accordingly.
d[0] = 2;
// Fails as false positive because of the assignment above.
assert(array1[0] == 42);
// Fails because `a == b` is possible.
assert(a[0] == 2);
// Fails because `c[i] == b` is possible.
assert(c[0][0] == 2);
assert(d[0] == 2);
assert(b[0] == 1);
}
function g(
uint[] memory a,
uint[] memory b,
uint[][] memory c,
uint x
) public {
f(a, b, c, array2[x]);
}
}
After the assignment to `` b[0] `` , we need to clear knowledge about `` a `` since
it has the same type (`` uint[] `` ) and data location (memory). We also need to
clear knowledge about `` c `` , since its base type is also a `` uint[] `` located
in memory. This implies that some `` c[i] `` could refer to the same data as
`` b `` or `` a `` .
Notice that we do not clear knowledge about `` array `` and `` d `` because they
are located in storage, even though they also have type `` uint[] `` . However,
if `` d `` was assigned, we would need to clear knowledge about `` array `` and
vice-versa.
2021-08-20 16:42:06 +00:00
Contract Balance
================
A contract may be deployed with funds sent to it, if `` msg.value `` > 0 in the
deployment transaction.
However, the contract's address may already have funds before deployment,
which are kept by the contract.
Therefore, the SMTChecker assumes that `` address(this).balance >= msg.value ``
in the constructor in order to be consistent with the EVM rules.
The contract's balance may also increase without triggering any calls to the
contract, if
- `` selfdestruct `` is executed by another contract with the analyzed contract
as the target of the remaining funds,
- the contract is the coinbase (i.e., `` block.coinbase `` ) of some block.
To model this properly, the SMTChecker assumes that at every new transaction
the contract's balance may grow by at least `` msg.value `` .
2021-03-09 10:34:55 +00:00
***** ***** ***** ***** **
Real World Assumptions
***** ***** ***** ***** **
Some scenarios can be expressed in Solidity and the EVM, but are expected to
never occur in practice.
One of such cases is the length of a dynamic storage array overflowing during a
push: If the `` push `` operation is applied to an array of length 2^256 - 1, its
length silently overflows.
However, this is unlikely to happen in practice, since the operations required
to grow the array to that point would take billions of years to execute.
2021-08-20 16:42:06 +00:00
Another similar assumption taken by the SMTChecker is that an address' balance
can never overflow.
A similar idea was presented in `EIP-1985 <https://eips.ethereum.org/EIPS/eip-1985> `_ .