VT Code uses a local-first performance workflow. Performance checks are measured manually and are not hard CI gates. The default stance is simple: do not guess, measure first, and only keep complexity that pays for itself.
- Keep release artifacts portable.
- Improve runtime without hurting day-to-day iteration speed.
- Optimize only measured hotspots.
- Do not guess where time goes. Capture a baseline before changing code that claims a performance win.
- Measure before tuning. Keep before/after numbers from
baseline.sh, targeted timers, or benchmarks. - Prefer simple algorithms when input sizes are small or not yet proven large.
- Avoid fancy algorithms and broad refactors unless measurements justify their constant-factor and maintenance cost.
- Start with data structures and layout. In VT Code, the right cache shape, queue boundary, or representation usually matters more than clever control flow.
These rules apply to product code and refactors alike. The burden of proof is on the optimization, not on the simpler baseline.
# 1) Capture baseline
./scripts/perf/baseline.sh baseline
# 2) Make a targeted change
# 3) Capture latest
./scripts/perf/baseline.sh latest
# 4) Compare results
./scripts/perf/compare.shArtifacts are written to .vtcode/perf/ and include JSON metrics plus raw logs.
The perf harness clears RUSTC_WRAPPER and CARGO_BUILD_RUSTC_WRAPPER by default for its cargo steps so local measurements still work when sccache is configured but unavailable. Set PERF_KEEP_RUSTC_WRAPPER=1 only when you explicitly want to keep the wrapper. startup_ms measures the built target/debug/vtcode binary rather than cargo run, which keeps compile time out of the startup number.
Use this loop for any non-trivial performance change. Change one thing at a time so the comparison stays attributable.
Use this when collecting profiler traces:
./scripts/perf/profile.shThis builds release with:
-C force-frame-pointers=yesCARGO_PROFILE_RELEASE_DEBUG=line-tables-only
Then profile target/release/vtcode with your preferred tool.
For local experiments only:
./scripts/perf/native-build.sh
./scripts/perf/native-run.sh -- --versionThese scripts append -C target-cpu=native for local runs only. They do not change portable release defaults.
Current Criterion benches:
cargo bench -p vtcode-core --bench tool_pipeline
cargo bench -p vtcode-tools --bench cache_benchUse benches when a hotspot is stable and repeatable. Use the baseline/profile scripts when the question is broader end-to-end behavior.
- Change one thing at a time.
- Keep changes surgical and behavior-preserving.
- Prefer simple, safe single-pass reductions over broad refactors.
- Revisit data structures before introducing algorithmic sophistication.
- Keep the simplest implementation until measured workload data proves it insufficient.
- For hashers, follow the selective policy in
performance-hasher-policy.md.