Enable mlir attention for RDNA gfx10/11 by klin2024 · Pull Request #4772 · ROCm/AMDMIGraphX

klin2024 · 2026-04-10T02:16:33Z

Motivation

On RDNA, MLIR attention is not enabled by default, causing attention to be computed as separate unfused ops: GEMM → scale → softmax → GEMM.

Technical Details

Enabling MLIR attention fuses the above ops into a single kernel for RDNA gfx10/11, which can significantly improve performance.

Changelog Category

Add a CHANGELOG.md entry for any option other than Not Applicable

- Added: New functionality.
- Changed: Changes to existing functionality.
- Removed: Functionality or support that has been removed. (Compared to a previous release)
- Optimized: Component performance that has been optimized or improved.
- Resolved Issues: Known issues from a previous version that have been resolved.
- Not Applicable: This PR is not to be included in the changelog.

On RDNA, attention is not enabled by default. It will use GEMM + scale + softmax + gemm OPs without any fusion. Enable attention let above OP fused, and can signifcantly improve performance.

enable attention for RDNA

162106c

On RDNA, attention is not enabled by default. It will use GEMM + scale + softmax + gemm OPs without any fusion. Enable attention let above OP fused, and can signifcantly improve performance.

klin2024 changed the title ~~enable mlir attention for RDNA gfx10/11~~ Enable mlir attention for RDNA gfx10/11 Apr 10, 2026

Merge branch 'develop' into enable_RDNA_attention

bca2894

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable mlir attention for RDNA gfx10/11#4772

Enable mlir attention for RDNA gfx10/11#4772
klin2024 wants to merge 2 commits intodevelopfrom
enable_RDNA_attention

klin2024 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

klin2024 commented Apr 10, 2026

Motivation

Technical Details

Changelog Category

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants