-
-
Notifications
You must be signed in to change notification settings - Fork 245
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Develop Mac versions of solutions for a few labs #60
Comments
You can try to use "sse2neon" |
Thanks! Good idea. |
Hi @dendibakh, I can confirm that using sse2neon to solve the ["core_bound"]["compiler_intrinsics_1"] does work, albeit it is a bit slower than writing pure ARM Neon code, due to the differences in the architectures and the more instructions need to translate from SSE to NEON using the same x86 algorithms. You can check the CI Job as well as my branch with commits . Once again, thank you for the excellent work! |
Ups, I did not mean to close this. Denis, can you please re-open this? |
This is nice to know! I haven't used sse2neon before.
I can't find your NEON implementation. Can you please share it?
You're welcome! :) |
Hey @dendibakh This is it! implementation. You can look at the history as well as see that I added the sse2neon.h in the compiler_intrinsics_1 folder. You can add that as a dependency that you would pull automatically on ARM devices compatible with Neon. Additionally, here is the link to the CI job for M1 Mac, I had to enable the CI to run on M1 for this lab, I did that here. P.S.: I am sorry for temporarily closing the issue again. The GitHub interface is not friendly enough for me. Kind regards, |
I thought you said you wrote NEON instrinsics yourself without using sse2neon library, no? |
Hey, @dendibakh, I am sorry for the misunderstanding. However, I did not say that. I only said that I confirmed that it could be done using the sse2neon library. Kind regards, |
Ok, got it, no worries. Thanks for sharing your experiments. |
Currently the following labs don't have solutions for Mac M1 platform:
This prevents automated benchmarking of their speedups in CI.
The text was updated successfully, but these errors were encountered: