-
Notifications
You must be signed in to change notification settings - Fork 671
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Codegen][GPU] Finish splitting NV intrinsics from AMD ones (#19853)
The split of `WMMA_*` enums into Nvidia and AMD variants was half finished. This completely splits the handling of each vendor. In the process, because concrete layouts for nvidia intrinsics is unimplemented, the only supported case is opaque layouts via SPIR-V. This required re-introducing `getMNKShape` per enum value rather than inferring it from the layout. This PR is effectively NFC, but unblocks enabling LLVMGPUTileAndFuse by default for matmuls.
- Loading branch information
Showing
3 changed files
with
75 additions
and
15 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters