Skip to content

Releases: JuliaGPU/cuTile.jl

v0.2.1

10 Apr 06:35
9fcbc50

Choose a tag to compare

cuTile v0.2.1

Diff since v0.2.0

Merged pull requests:

v0.2.0

08 Apr 11:55

Choose a tag to compare

cuTile v0.2.0

Diff since v0.1.2

Breaking changes: see https://juliagpu.org/post/2026-04-08-cutile_0.2/

Merged pull requests:

Closed issues:

  • Alias-aware token threading for better parallelism (#1)
  • Port additional examples (#14)
  • Support for overflow options in Integer cases (#59)
  • Matmul broadcasting (#115)
  • TagBot: Manual intervention needed for releases (#131)
  • Layernorm regression: Token threading requires loop parallel store optimization (#146)
  • Broadcast involving scalar fails (#169)

v0.1.2

20 Mar 16:55
6ec5c8c

Choose a tag to compare

cuTile v0.1.2

Diff since v0.1.0

Merged pull requests:

Closed issues:

  • Launching kernels with arrays outside GPU (#98)
  • Compiler failures with nested while loops and runtime values in ct.full (#99)
  • TagBot trigger issue (#101)
  • Nested while loops produce wrong results (#102)
  • [Feature Request] Support for loops within the kernel (#103)
  • InvalidTerminatorError: dot-broadcast inside while loop causes yield type mismatch (#104)
  • Bitwise operations on tiles crash tileiras with ProcessExited(3) (#105)
  • Pre-Blackwell support (#109)
  • Architecture-specific configuration (#112)
  • transpose semantics on Tile (#114)
  • Splatted tile sizes to align with Julia functions (#117)
  • TagBot doesn't work (#120)
  • Downgrading deps makes tests fail (#130)

v0.1.1

20 Mar 16:59
f130734

Choose a tag to compare

What's Changed

Full Changelog: v0.1.0...v0.1.1

v0.1.0

20 Feb 22:55
d595fcc

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: https://github.com/JuliaGPU/cuTile.jl/commits/v0.1.0