Motivation
Show the current state of NPU benchmarks. It also sets the direction for the next phase of performance optimization tasks.
Evaluation criteria: The speedup ratio of each kernel compared to huggingface/torch should be greater than 1.
Env
Test machine environment
- platform: https://www.autodl.com/
- NPU: Atlas 900 A2 PoD(64G)
- HDK=25.2.0
- CANN=8.5.0
- CPU: 24 vCPU Kunpeng-920
- OS: ubuntu22.04
Software dependencies
- python=3.10.8
- Liger-Kernel=0.7.0, Commit ID: 781083b
- torch=2.6.0
- torch_npu=2.6.0
- torchvision==0.21.0
- triton-ascend=3.2.0
- transformers=5.2.0
Motivation
Show the current state of NPU benchmarks. It also sets the direction for the next phase of performance optimization tasks.
Evaluation criteria: The speedup ratio of each kernel compared to huggingface/torch should be greater than 1.
Env
Test machine environment
Software dependencies