Skip to content

Conversation

@degasus
Copy link
Contributor

@degasus degasus commented Jul 21, 2025

This file was completely disabled and never included, and so not tested and broken.

This patches fixes:

  • Missing include in the _isa.hpp
  • Typo in the include guard
  • Variable name within fast_cast
  • Requirements to avx512dq

I have to be honest: I have no clue if those implementations are correct. My patch "just" enables the previously dead code. Please see this PR more as a bug report than as a fully tested patch.

FYI: I've noticed this as to_float(int64) was very slow on MSVC. Please see: https://gcc.godbolt.org/z/q3MMn3q69
GCC is able to optimize xsimd's slow path to the single instruction in -O2, MSVC was not so lucky...

This file was completely disabled and never included, and so not tested and broken.

This patches fixes:
* Missing include in the _isa.hpp
* Typo in the include guard
* Variable name within fast_cast
* Requirements to avx512dq
#include "./xsimd_avx512f.hpp"
#endif

#if XSIMD_WITH_AVX512DQ
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI: XSIMD_WITH_AVX512CD is missing as well. However it does not define anything, but might yield the same issue again.

@serge-sans-paille serge-sans-paille merged commit bae9d19 into xtensor-stack:master Jul 21, 2025
63 checks passed
@serge-sans-paille
Copy link
Contributor

Thanks!

@degasus degasus deleted the avx512_dq branch July 22, 2025 05:47
@DiamonDinoia
Copy link
Contributor

Interesting. I did the same changes here :)
DiamonDinoia/xsimd@improving-avx512-swizzle

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants