A Guide to Porting Original Ethereum Tests to EEST¶

Background¶

EEST is the successor to ethereum/tests (aka "original tests"), a repository that defined EVM test cases from the Frontier phase up to and including The Merge. These test cases are specified as YAML (and occasionally JSON) files in the ./src/ sub-directory. JSON test fixtures, which are fully-populated tests that can be executed against clients, are generated using ethereum/retesteth. These JSON artifacts are regenerated when needed and added to the repository, typically in the tests/static/state_tests sub-directory.

From Shanghai onward, new test cases — especially for new features introduced in hard forks—are defined in Python within EEST. While the existing test cases remain important for client testing, porting ethereum/tests to EEST will help maintain and generate tests for newer forks. This also ensures feature parity, as client teams will only need to obtain test fixture releases from one source.

While automating the conversion of the remaining YAML (or JSON) test cases to Python is possible, manually porting individual test cases offers several benefits:

Reducing the number of test cases by combining multiple YAML (or JSON) cases into a single Python test function using parametrization.
Potentially improving coverage by parametrizing the Python version.
Producing higher quality code and documentation, which are typically clearer than an automated conversion.
Ensuring better organization of tests within the ./tests folder of execution-spec-tests by fork and EIP.

Porting an original test¶

Select one or more test cases from ./tests/static/state_tests/ to port and create an issue in this repository AND comment on this tracker issue.
Add a new test in the appropriate fork folder, following the guidelines for choosing a test type.
Submit a PR with the ported tests:
1. Add the list of ported files using python marker to the head of your python test.
  
  Example:
```
 @pytest.mark.ported_from(
[
    "https://github.com/ethereum/tests/blob/v13.3/src/GeneralStateTestsFiller/stCreateTest/CREATE_ContractSuicideDuringInit_ThenStoreThenReturnFiller.json",
    "https://github.com/ethereum/tests/blob/v13.3/src/GeneralStateTestsFiller/stCreateTest/CREATE_ContractSuicideDuringInit_WithValueFiller.json",
],
pr=["https://github.com/ethereum/execution-spec-tests/pull/1871"],
# coverage_missed_reason="Converting solidity code result in following opcode not being used:",
```
  Replace test names with your chosen tests and PR number.
  
  Uncomment coverage_missed_reason when all the missed coverage lines are approved, usually some opcodes end up not used after translating test logic from lllc, yul.
  
  But sometimes missed coverage line could hint that you forgot to account important test logic.
  
  If no coverage is missed, you are good!
2. Remove the ported files from .tests/static/state_tests in your PR

See also: 📄 Getting started with EEST.

Filling tests¶

EEST uses pytest to run tests against EELS (an EVM implementation for testing). This process is known as "filling" and verifies the assertions in your tests. You can use the fill CLI for this. For example, see how to fill the PUSH opcode.

uv run fill tests/frontier/opcodes/test_push.py

If the tests can't currently be filled, please explain the issue (feel free to also open a Discussion).

Debugging tests¶

By default, EVM logs are stored in the logs folder at the repository root. You can check the output folder to review transaction results. If needed, review a previous PR that ported tests (e.g., the PR porting the PUSH opcode, and other port PRs).

Test coverage¶

It's crucial that ported tests maintain coverage parity with original tests. This ensures that no critical functions are left untested and prevents the introduction of bugs. A CI workflow automatically checks for coverage.

If coverage action fails (See: 📄 An example of a failing test coverage), it's recommended to run the coverage action locally (see: 📄 How to run GitHub actions locally), which should generate a evmtest_coverage directory:

❯ tree evmtest_coverage  -L 2
evmtest_coverage
└── coverage
    ├── BASE
    ├── BASE_TESTS
    ├── coverage_BASE.lcov
    ├── coverage_PATCH.lcov
    ├── DIFF
    ├── difflog.txt
    ├── PATCH
    └── PATCH_TESTS

Here BASEis original tests, PATCH is the ported test, and DIFF is the coverage difference on EVMONE. Open evmtest_coverage/coverage/DIFF/index.html in browser:

Label	Description
`LBC`	Lost base coverage: Code that was tested before, but is untested now.
`UBC`	Uncovered baseline code: Code that was untested before and untested now.
`GBC`	Gained baseline coverage: Code that was untested before, but is tested now.
`CBC`	Covered baseline code: Code that was tested before and is tested now.

Follow the hyperlinks for lost base coverage (LBC) to address coverage gaps. Here is an example coverage loss:

Lost line coverage from a coverage report. In this case, caused by a missing invocation of CALLDATALOAD.

Expected coverage loss

EEST uses pytest, a popular Python testing framework, to help orchestrate testing Ethereum specifications, while original tests relied on large, static contracts and the EVM to handle much of the execution. This difference can lead to coverage gaps. EEST favors dynamic contract creation for each test vector, while original tests preferred a single static contract with multiple test vectors determined by transaction input data.

It's important to note that coverage helps identify missing test paths. If you believe the coverage loss is due to differences in "setup" code between frameworks and doesn't impact the feature you're testing, explain this in your PR. A team member can help with the review.

For example, review the [discussion in this PR] to see an example of why and how coverage loss can occur.(https://github.com/ethereum/execution-spec-tests/pull/975#issuecomment-2528792289)

Resolving Coverage Gaps from Yul Compilation¶

When porting tests from ethereum/tests, you may encounter coverage gaps that are false positives. This commonly occurs because:

Original tests often used Yul to define smart contracts, and solc compilation introduces additional opcodes that aren't specifically under test
EEST ports use the explicit EEST Opcode mini-language, which is more precise in opcode definition

If coverage analysis shows missing opcodes that were only present due to Yul compilation artifacts (not the actual feature being tested), this can be resolved during PR review by adding the coverage_missed_reason parameter:

@pytest.mark.ported_from(
    ["path/to/original_test.json"],
    coverage_missed_reason="Missing opcodes are Yul compilation artifacts, not part of tested feature"
)

Add coverage_missed_reason only after PR review

Only add coverage_missed_reason after PR review determines the coverage gap is acceptable, never preemptively. This helps maintain test coverage integrity while accounting for legitimate differences between Yul-based and EEST opcode-based implementations.