-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is any plan to publish the manual like nvidia ptx spec? #133
Comments
Hello @alanzhai219 Thank you for reaching out and for your interest in our project. At the moment, we do not plan to refactor the documentation beyond the information presented in https://github.com/intel/vc-intrinsics/blob/master/GenXIntrinsics/include/llvm/GenXIntrinsics/Intrinsic_definitions.py. If you have specific questions about any of the supported intrinsics, please feel free to send them our way. Best regards, |
I mean why not publish a manual like https://docs.nvidia.com/cuda/pdf/ptx_isa_8.5.pdf. |
I totally agree that such a manual would be valuable. However, it currently sits low on our priority list. The reason is that most people use mainstream languages like DPC++ (Data Parallel C++) or DPC++ ESIMD (Explicit SIMD) extensions and do not use VC intrinsics directly. Those who use VC intrinsics to implement their programming languages usually know the architecture details very well and do not need detailed manuals. I hope that at some point we'll manage to provide detailed documentation. Until this happens, we can answer questions. Best regards, |
@ozhdikhin I see. It makes sense for now. |
btw, do you guys read the popular paper, deepseek? The kernel is written in ptx and the perf is incredible. That is why I highlight the GPU assembly. Please pay attention to it in the following gens. @ozhdikhin |
@ozhdikhin Should provide different API from high level to low level. |
@alanzhai219 Your feedback aligns with the strategic direction we are taking in the development of Intel's future GPU software stack. Stay tuned for updates. |
It is a bit hard to read and understand the syntax.
Do you have any plan to refactor and publish like nvidia PTX virtual instructions, which is easy to understand and invoke?
The text was updated successfully, but these errors were encountered: