-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing sharding specs when annotating sharding over views #8662
Comments
I thought @yaochengji already fixed this. |
I pointed to #8427 as possibly being the same underlying issue - but that one was without functionalization. The failure even happens with (smaller) cases, but I don't see why it should end up causing an issue on XLA (let me know if this is expected - since this is a minimal repro example that may not capture a real use case). In the end, we're trying to reconcile different sharding specs before and after the reshape/view, but we see this with Llama 3 when trying to add custom sharding at different points (e.g. SP):
following with a reshape on a subsequent custom sharding:
|
🐛 Bug
The HLO instruction for the custom sharding call is missing the sharding specs, leading to
has_sharding
failures on XLA:This issue was earlier identified in #8427, but with manual sharding. @JackCaoG did some investigation, but we didn't entirely RCA the issue yet. The issue can be minimally reproduce with the mark sharding as well, and we observe the same problem when adding any custom sharding prior to the input layer normalization for Llama3.
To Reproduce
Expected behavior
We expect the appropriate sharding spec to be present in the custom sharding call, namely (for 1) above), to include:
Environment
The text was updated successfully, but these errors were encountered: