Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[XPU][OptRed] Define triton_intel_gpu.simd_reduce and use in optimized transposed reduction #2907

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

victor-eds
Copy link
Contributor

Define SIMD transpose-reduce operation performing a SIMD reduction while transposing the implicit SIMD matrix. See description definition for further context.

Using this operation in the transpose reduction pass allows us to perform the optimization while not using SLM.

@victor-eds victor-eds requested review from whitneywhtsang, chengjunlu and a team December 3, 2024 14:26
@victor-eds victor-eds self-assigned this Dec 3, 2024
…zed transposed reduction

Define SIMD transpose-reduce operation performing a SIMD reduction while transposing
the implicit SIMD matrix. See description definition for further context.

Using this operation in the transpose reduction pass allows us to perform the optimization
while not using SLM.

Signed-off-by: victor-eds <[email protected]>
Co-authored-by: chengjunlu <[email protected]>
Signed-off-by: Victor Perez <[email protected]>
@victor-eds victor-eds force-pushed the in-reduction-transpose branch from 5e578e3 to 114add9 Compare December 3, 2024 14:50
@victor-eds
Copy link
Contributor Author

Something is off with the vISA generated. I'm looking into that. Delay reviews.

@victor-eds
Copy link
Contributor Author

IGC vISA parser fails to parse this when combined with DPAS instruction. Waiting for fix.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant