Skip to content

[RFC, NPU] NPU benchmark - as a baseline for follow-up tasks. #1159

@zheliuyu

Description

@zheliuyu

Motivation

Show the current state of NPU benchmarks. It also sets the direction for the next phase of performance optimization tasks.

Evaluation criteria: The speedup ratio of each kernel compared to huggingface/torch should be greater than 1.

Env

Test machine environment

  • platform: https://www.autodl.com/
  • NPU: Atlas 900 A2 PoD(64G)
  • HDK=25.2.0
  • CANN=8.5.0
  • CPU: 24 vCPU Kunpeng-920
  • OS: ubuntu22.04

Software dependencies

  • python=3.10.8
  • Liger-Kernel=0.7.0, Commit ID: 781083b
  • torch=2.6.0
  • torch_npu=2.6.0
  • torchvision==0.21.0
  • triton-ascend=3.2.0
  • transformers=5.2.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions