Skip to content

SME2/SVE/NEON heuristic - ACL#1294

Closed
damdoo01-arm wants to merge 1 commit into
ARM-software:mainfrom
damdoo01-arm:damdoo01/sme2_neon_heuristic
Closed

SME2/SVE/NEON heuristic - ACL#1294
damdoo01-arm wants to merge 1 commit into
ARM-software:mainfrom
damdoo01-arm:damdoo01/sme2_neon_heuristic

Conversation

@damdoo01-arm

Copy link
Copy Markdown

Title:
Allow runtime masking of SVE/SVE2 CPU feature exposure

Description:

This PR adds runtime control over whether ACL exposes SVE/SVE2 capabilities through arm_compute::CPUInfo. For full context, please refer to the associated ArmNN PR at: ARM-software/armnn#820

Problem statement:

ArmNN needs a way to steer ACL away from SME/SME2 and, in some cases, SVE/SVE2 kernel families when graph-level shape heuristics indicate that those paths regress performance. The regression is most visible on SME2-capable hardware under high thread count, where the hardware/resource pressure around SME2 packing can dominate the expected matmul acceleration for some Geekbench AI shapes.

High-level approach:

ACL already had runtime masking for SME/SME2 via set_sme_allowed(). This PR adds equivalent SVE/SVE2 masking:

void CPUInfo::set_sve_allowed(bool is_allowed);
When disabled, ACL reports SVE/SVE2 and related features as unavailable through:

has_sve()
has_sve2()
has_svebf16()
has_svei8mm()
has_svef32mm()
get_isa()
This lets ArmNN apply its graph-level policy while keeping ACL’s existing kernel selection mechanisms intact.

Relationship to ArmNN PR:

The ArmNN PR emits CpuAcc options such as:

SmeEnabled=false
SveEnabled=true
or:

SmeEnabled=false
SveEnabled=false
The ACL PR provides the underlying mechanism that makes those options affect runtime kernel selection.

@damdoo01-arm damdoo01-arm changed the title Damdoo01/sme2 neon heuristic SME2/SVE/NEON heuristic Jun 12, 2026
@damdoo01-arm damdoo01-arm changed the title SME2/SVE/NEON heuristic SME2/SVE/NEON heuristic - ArmNN Jun 12, 2026
@damdoo01-arm damdoo01-arm changed the title SME2/SVE/NEON heuristic - ArmNN SME2/SVE/NEON heuristic - ACL Jun 12, 2026
Expose runtime controls in CPUInfo so clients can mask SME/SME2 and SVE capabilities when selecting CPU kernels. This lets higher-level frameworks steer ACL away from ISA paths that should not be used for a graph while preserving default hardware-based selection when no override is supplied.

Full context in the ArmNN PR: ARM-software/armnn#820

Signed-off-by: Damien Dooley <damien.dooley@arm.com>

Change-Id: I602cebdd58942930d248948788bfac9e2be56474
@damdoo01-arm damdoo01-arm force-pushed the damdoo01/sme2_neon_heuristic branch from 992d402 to 6d46ae5 Compare June 16, 2026 12:10

@gunes-arm gunes-arm left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we change the commit message as:

feat: Add experimental SME/SVE runtime selection controls

And, also make the PR title the same?

*/
bool has_sme_b16f32() const;
/** Sets whether SVE and SVE2 implementations are allowed to be selected at runtime.
*

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add the following into all?

@experimental This API is under development and may change or be removed without notice in future releases.

@damdoo01-arm damdoo01-arm deleted the damdoo01/sme2_neon_heuristic branch June 16, 2026 13:32
@damdoo01-arm

Copy link
Copy Markdown
Author

Superseded by #1295.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants