[Stream] Implement SpecializeEncodings pass (1/n) #19502

hanhanW · 2024-12-17T14:45:34Z

There are three major changes in the revision:

Introduce AffinityAnalysisDialectInterface Stream dialect interface. It is used to fetch attributes that are defined by other dialects. In the revision, HAL implements the dialect interface, and it can return whatever attribute attached in HAL::ExecutableTarget attributes. The main idea of the dialect interface is that Stream does not need to depend on HAL to get the layout information.
Add cloneWithLayouts method to the EncodingAttr. It is used in the encoding specialization pass where it can resolve the layout requirements and add it to the layouts field. The other optional parameters are dropped because the layout is already resolved. It can be a new Encoding dialect attribute because it is just describing the layout. The stream tensor ops do not need to know the op_type, element_types and operand_index parameters. It only needs the layout information, and the attribute should implement the interface method.
Partially implement the SpecializeEncodings pass. The responsibility of the pass is large, so I decide to implement it incrementally. This revision only implements the mechanism of updating stream tensor ops' encoding, and only stream.tensor.sizeof op is supported. The rest of the support for other stream tensor op can be added later on. The executable duplication and the update of dispatch ops will be implemented in subsequent PRs.

compiler/src/iree/compiler/Dialect/Encoding/IR/EncodingAttrs.cpp

benvanik · 2024-12-17T14:58:19Z

compiler/src/iree/compiler/Dialect/HAL/IR/HALDialect.cpp

+  using AffinityAnalysisDialectInterface::AffinityAnalysisDialectInterface;
+  IREE::Stream::LayoutAttrSolverFn
+  makeLayoutAttrSolver(ModuleOp moduleOp) const {
+    return [=](IREE::Stream::AffinityAttr aff, Operation *op,


always use full names for things wherever possible - it's useful hints to readers as to what the type is and how it is treated - shortening to arbitrary forms makes it impossible to find/replace in codebases and more difficult for non-native speakers (here and anywhere else you've shortened things).

Suggested change

return [=](IREE::Stream::AffinityAttr aff, Operation *op,

return [=](IREE::Stream::AffinityAttr affinityAttr, Operation *op,

I see, done.

compiler/src/iree/compiler/Dialect/HAL/IR/HALDialect.cpp

compiler/src/iree/compiler/Dialect/Stream/IR/StreamInterfaces.h

benvanik · 2024-12-17T15:03:48Z

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp

+    // TODO(hanchung): We should use the default device in this case. However,
+    // it is not guaranteed that default device attribute will always be set in
+    // the IR. (Is the statement correct?)
+    auto affAttr = affinityOp.getAffinityAttr();


as mentioned, affinityAttr (here and elsewhere)

Done. @benvanik does my comment above make sense? If so, I'm going to remove the (Is the statement correct?).

At this point in the IR I don't believe there is a concept of default device, so the whole TODO is not required - in a program with multiple functions there may be many routes to a function through the call graph and we can't locally make decisions about the whole program like that.

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp

hanhanW · 2025-01-06T17:12:15Z

@benvanik can you take a second look at this?

Signed-off-by: hanhanW <[email protected]>

hanhanW requested review from bjacob and benvanik as code owners December 17, 2024 14:45

hanhanW requested review from lialan and Max191 and removed request for bjacob December 17, 2024 14:46

benvanik requested changes Dec 17, 2024

View reviewed changes

hanhanW requested a review from benvanik December 18, 2024 03:53

hanhanW mentioned this pull request Dec 19, 2024

Add support for executable duplication in encoding specialization pass. #19527

Open

lialan reviewed Dec 19, 2024

View reviewed changes

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp Outdated Show resolved Hide resolved

lialan reviewed Dec 19, 2024

View reviewed changes

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp Show resolved Hide resolved

lialan reviewed Dec 19, 2024

View reviewed changes

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp Show resolved Hide resolved

hanhanW requested a review from lialan January 6, 2025 07:39

lialan reviewed Jan 6, 2025

View reviewed changes

compiler/src/iree/compiler/Dialect/Stream/Transforms/SpecializeEncodings.cpp Outdated Show resolved Hide resolved

hanhanW added 5 commits January 7, 2025 13:04

[Stream] Implement SpecializeEncodings pass (1/n)

42da1b6

Signed-off-by: hanhanW <[email protected]>

address comments

ee6cdf8

Signed-off-by: hanhanW <[email protected]>

address comments and 2024->2025

e5d75de

Signed-off-by: hanhanW <[email protected]>

revert changes of MapVector construction.

92a9908

Signed-off-by: hanhanW <[email protected]>

Add comments to the tests.

86dd9b0

Signed-off-by: hanhanW <[email protected]>

hanhanW force-pushed the specialize-encodings-1-n branch from a76fb9a to 86dd9b0 Compare January 7, 2025 05:08

benvanik approved these changes Jan 8, 2025

View reviewed changes

remove TODO

ddec589

Signed-off-by: hanhanW <[email protected]>

hanhanW enabled auto-merge (squash) January 9, 2025 03:19

hanhanW merged commit 02d145e into iree-org:main Jan 9, 2025
32 checks passed

hanhanW deleted the specialize-encodings-1-n branch January 9, 2025 03:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Stream] Implement SpecializeEncodings pass (1/n) #19502

[Stream] Implement SpecializeEncodings pass (1/n) #19502

hanhanW commented Dec 17, 2024 •

edited

Loading

benvanik Dec 17, 2024

hanhanW Dec 18, 2024

benvanik Dec 17, 2024

hanhanW Dec 18, 2024

benvanik Jan 8, 2025

hanhanW commented Jan 6, 2025

	return [=](IREE::Stream::AffinityAttr aff, Operation *op,
	return [=](IREE::Stream::AffinityAttr affinityAttr, Operation *op,

[Stream] Implement SpecializeEncodings pass (1/n) #19502

[Stream] Implement SpecializeEncodings pass (1/n) #19502

Conversation

hanhanW commented Dec 17, 2024 • edited Loading

benvanik Dec 17, 2024

Choose a reason for hiding this comment

hanhanW Dec 18, 2024

Choose a reason for hiding this comment

benvanik Dec 17, 2024

Choose a reason for hiding this comment

hanhanW Dec 18, 2024

Choose a reason for hiding this comment

benvanik Jan 8, 2025

Choose a reason for hiding this comment

hanhanW commented Jan 6, 2025

hanhanW commented Dec 17, 2024 •

edited

Loading