-
Notifications
You must be signed in to change notification settings - Fork 293
Support compress&expand in avx512vbmi2 #1102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@serge-sans-paille |
this caused by _mm512_shuffle_epi8 for batch swizzle in avx512bw which is incorrect. use _mm512_permutexvar_epi8 in avx512vbmi instead. |
|
@serge-sans-paille , |
|
@junparser #1104 should do the trick |
Also, after this commit, would you like to add one more |
|
|
Thanks, now this pr passes avx512vbmi2. Also is it possible to add something like |
|
|
||
| using all_x86_architectures = arch_list< | ||
| avx512vnni<avx512vbmi>, avx512vbmi, avx512ifma, avx512pf, avx512vnni<avx512bw>, avx512bw, avx512er, avx512dq, avx512cd, avx512f, | ||
| avx512vnni<avx512vbmi2>, avx512vbmi2, avx512vbmi, avx512ifma, avx512pf, avx512vnni<avx512bw>, avx512bw, avx512er, avx512dq, avx512cd, avx512f, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you motivate this change? Why would composition between avx512 VNNI be more legit with VMBI2 rather than VBMI?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I made this change based on https://en.wikichip.org/wiki/x86/avx512_vnni. The table shows that all of the arch have vbmi2 as well as vnni.
No description provided.