Performance Benchmarks

Performance benchmark tests are located under forge/test/benchmark/ and are marked with @pytest.mark.perf. These tests measure end-to-end inference throughput on Tenstorrent device by compiling the model with Forge, running timed device iterations, and verifying numerical accuracy against a CPU reference.

Directory layout

forge/test/benchmark/
├── conftest.py               # CLI options + forge_benchmark_options fixture
├── options.py                # ForgeBenchmarkOptions dataclass
├── utils.py                  # Console reporting + JSON result helpers
├── test_vision.py            # pytest entry points (ResNet-50, etc.)
└── benchmarks/
    └── vision_benchmark.py   # compile / warmup / timed loop / PCC logic

Running benchmarks locally

To run all benchmark tests:

pytest -m perf forge/test/benchmark

To run a specific model:

pytest -svv -m perf forge/test/benchmark/test_vision.py::test_resnet50

To save results to a JSON file:

pytest -m perf forge/test/benchmark --output-file results.json

CLI options

These flags are registered in conftest.py and apply to any test under forge/test/benchmark. All overrides are optional; each test defines its own defaults.

Option	Default	Description
`--output-file PATH`	`None`	Path to write benchmark results as JSON. If omitted, no file is written.
`--batch-size N`	per-test default	Number of samples per inference call (positive integer).
`--loop-count N`	per-test default	Number of timed iterations after warmup (positive integer).
`--warmup-count N`	`min(32, loop_count)`	Number of warmup iterations before timing begins (positive integer).
`--data-format`	`bfloat16`	Data format for model inputs and compiler config: `float32` or `bfloat16`.
`--training`	`False`	Run in training mode. Not supported by current benchmarks; raises an error if set.

Example:

pytest -svv -m perf forge/test/benchmark/test_vision.py::test_resnet50 \
    --batch-size 8 \
    --loop-count 128 \
    --warmup-count 32 \
    --data-format bfloat16 \
    --output-file resnet50_results.json

Adding a new benchmark model

Add a parametrized test in test_vision.py for the same model family, or create a new test_<family>.py for a different category. Each test must be marked with @pytest.mark.perf and accept the forge_benchmark_options and forge_tmp_path fixtures.