Skip to content

NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093

Open
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-weight-swizzle-cache
Open

NVFP4: cache GEMM-swizzled weight scale factors across micro-batches#3093
cael-ling wants to merge 2 commits into
NVIDIA:mainfrom
cael-ling:feature/nvfp4-weight-swizzle-cache

[pre-commit.ci] auto fixes from pre-commit.com hooks

f04d800
Select commit
Loading
Failed to load commit list.