[LAYOUTS] Implement generically getUniqueContigPerThread #5840

lezcano · 2025-02-06T16:49:59Z

This allows vectorisation on global loads and smem in some cases we didn't use it before, as we now compute the order of the elements looking at the actual LinearLayout associated to the given shape of the tensor, which is quite neat.

We end up touching a few things in the Scan lowering as BlockedLayouts when converted to LinearEncodings may not have the same order on elems/threads/warps. This is a feature, not a bug, as it takes us closer to supporting arbitrary LinearEncodings within the tt.scan op.

ThomasRaoux

LGTM

lezcano · 2025-02-11T20:21:24Z

test/TritonGPU/amd/optimize-lds-usage.mlir

@@ -44,29 +44,30 @@ module attributes {"ttg.num-warps" = 8 : i32, "ttg.threads-per-warp" = 64 : i32}
  }
 }

-// -----
+// // -----


@antiagainst This PR is breaking these two lit tests. I think it's coming from the changes in Allocation.cpp, but I have not track them down. I assume the fix would be something like using LinearEncodingAttr::getOrder instead of getOrder somewhere or so.

Would it be alright to merge this PR as-is and you guys could then send a forward-fix?

Feel free to land; we can fix later. @binarman FYI

lezcano · 2025-02-11T22:45:30Z

@ThomasRaoux @Jokeren this one's ready for another review. I ended up having to fix a ton of things in the scan lowering, but hopefully all of them are reasonable enough.

Jokeren

I think there needs to be a larger refactoring work on reduce and scan as expected, though we should stop if there's any more important tasks.

lib/Analysis/Allocation.cpp

lib/Dialect/TritonGPU/IR/Dialect.cpp

As per title

…ectorisation lenghts

lib/Conversion/TritonGPUToLLVM/ScanOpToLLVM.cpp

ThomasRaoux

Looks good!

lezcano requested a review from ptillet as a code owner February 6, 2025 16:50

ThomasRaoux approved these changes Feb 6, 2025

View reviewed changes

lezcano marked this pull request as draft February 6, 2025 22:25

lezcano force-pushed the layouts branch 3 times, most recently from 04fb39a to 18d4265 Compare February 11, 2025 20:07

lezcano commented Feb 11, 2025

View reviewed changes

lezcano marked this pull request as ready for review February 11, 2025 22:44

lezcano requested a review from Jokeren as a code owner February 11, 2025 22:44

Jokeren reviewed Feb 12, 2025

View reviewed changes

lib/Analysis/Allocation.cpp Outdated Show resolved Hide resolved

lib/Dialect/TritonGPU/IR/Dialect.cpp Outdated Show resolved Hide resolved

This comment was marked as spam.

Sign in to view

lezcano added 8 commits February 12, 2025 14:27

[LAYOUTS] Implement generically getUniqueContigPerThread

676bce6

As per title

cleanup

91a41b7

At last it works

a43dee6

revert tests commented out

2f6c66d

move comment

7cd16c5

properly disable tests

6d7d754

Clamp vectorisation to avoid computing the padding with unrealistic v…

b3efaec

…ectorisation lenghts

Don't use magic numbers

6cdd4db

lezcano force-pushed the layouts branch from 164e9ac to 6cdd4db Compare February 12, 2025 15:00

Kill old method

533f418

ThomasRaoux reviewed Feb 12, 2025

View reviewed changes

lib/Conversion/TritonGPUToLLVM/ScanOpToLLVM.cpp Show resolved Hide resolved

ThomasRaoux approved these changes Feb 13, 2025

View reviewed changes

lezcano merged commit 06941f4 into main Feb 13, 2025
7 checks passed

lezcano deleted the layouts branch February 13, 2025 00:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LAYOUTS] Implement generically getUniqueContigPerThread #5840

[LAYOUTS] Implement generically getUniqueContigPerThread #5840

lezcano commented Feb 6, 2025 •

edited

Loading

ThomasRaoux left a comment

lezcano Feb 11, 2025

lezcano Feb 11, 2025

antiagainst Feb 11, 2025

lezcano commented Feb 11, 2025

Jokeren left a comment

This comment was marked as spam.

ThomasRaoux left a comment

[LAYOUTS] Implement generically getUniqueContigPerThread #5840

[LAYOUTS] Implement generically getUniqueContigPerThread #5840

Conversation

lezcano commented Feb 6, 2025 • edited Loading

ThomasRaoux left a comment

Choose a reason for hiding this comment

lezcano Feb 11, 2025

Choose a reason for hiding this comment

lezcano Feb 11, 2025

Choose a reason for hiding this comment

antiagainst Feb 11, 2025

Choose a reason for hiding this comment

lezcano commented Feb 11, 2025

Jokeren left a comment

Choose a reason for hiding this comment

This comment was marked as spam.

ThomasRaoux left a comment

Choose a reason for hiding this comment

lezcano commented Feb 6, 2025 •

edited

Loading