[cuda.compute]: add benchmarks to measure host side overhead#9432
[cuda.compute]: add benchmarks to measure host side overhead#9432NaderAlAwar wants to merge 5 commits into
Conversation
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
SummaryThis PR adds host-side pytest-benchmark benchmarks for cuda.compute to measure host-side overheads and first-time wrapper build/JIT costs, addressing issues Key Changes
Other notes for reviewers / TODOs
suggestion: WalkthroughAdds a host-side benchmark framework for cuda.compute: types and noop-patching, per-primitive harnesses, a CASES catalog, pytest-benchmark tests for build/oneshot/twoshot host overhead, and dependency/config updates to enable pytest-benchmark. ChangesHost-side benchmark framework for cuda.compute
Assessment against linked issues
Comment |
There was a problem hiding this comment.
Actionable comments posted: 1
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: 457dee23-afbc-4cdb-a72e-afc227efe60a
📒 Files selected for processing (5)
.gitignorepython/cuda_cccl/benchmarks/compute/host/host_benchmark_cases.pypython/cuda_cccl/benchmarks/compute/host/test_host_pytest_benchmark.pypython/cuda_cccl/benchmarks/compute/pixi.tomlpython/cuda_cccl/pyproject.toml
|
pre-commit.ci autofix |
|
/ok to test a921939 |
|
pre-commit.ci autofix |
|
/ok to test 8172fd6 |
🥳 CI Workflow Results🟩 Finished in 1h 27m: Pass: 100%/51 | Total: 14h 07m | Max: 54m 34sSee results here. |
Description
closes #9028
closes #9431
Checklist