Search code examples
Using F# and SIMD to search for index of value...

f#simdavx

Read More
How can I extract a byte from __m256i AVX2 register into another __m256i register?...

csimdintrinsicsavxavx2

Read More
unexpected _mm256_shuffle_epi with __256i vectors...

c++intrinsicsavxavx2

Read More
Visual Studio debugger sets the upper half of AVX registers to zero...

c++visual-studiovisual-studio-debuggingavx

Read More
_mm256_rem_epu64 intrinsic not found with GCC 10.3.0...

c++simdavxavx512

Read More
Is there an AVX2 instruction (and intrinsic) to broadcast load a 16 bit value 16 times into an __m25...

c++sseintrinsicsavxavx2

Read More
No insert and extract for float/double in SSE and AVX?...

c++floating-pointssesimdavx

Read More
AVX-optimized addition of two vectors containing only 3 elements...

optimizationx86simdavx

Read More
What does memory 32bit Alignement constraint mean for AVX?...

c++avxavx2

Read More
What is the most efficient way to clear a single or a few ZMM registers on Knights Landing?...

assemblyavxxeon-phiavx512knights-landing

Read More
Writing a vector sum function with SIMD (System.Numerics) and making it faster than a for loop...

c#arraysperformancesimdavx

Read More
How to detect AVX2 support using gcc...

gccg++avxinstruction-setavx2

Read More
Illegal instruction from VS C++ on Windows...

c++assemblyvisual-c++avxillegal-instruction

Read More
Conditional move (cmov) for AVX vector registers based on scalar integer condition?...

assemblyx86avxavx2conditional-move

Read More
How do I know which AVX C functions are available on different processor models...

coptimizationx86avxcpuid

Read More
Pack (with saturation) __m256i of 16-bit values to __m128i of 8-bit values?...

x86simdavxavx2

Read More
How to get AVX512 in C#?...

c#simdavxavx512

Read More
Convert "__m256 with random-bits" into float values of [0, 1] range...

c++randomfloating-pointsimdavx

Read More
String length function is unstable...

csimdmemory-alignmentavxstrlen

Read More
Searching for the key using SIMD...

csimdavx

Read More
Load or shuffle a pair of floats with SIMD intrinsics for doubles?...

cssesimdintrinsicsavx

Read More
SIMD: Bit-pack signed integers...

ssesimdavxavx2avx512

Read More
cmpeqpd sometimes returns wrong values...

assemblyfloating-pointsseavxdenormal-numbers

Read More
Tensorflow for mac AVX...

pythontensorflowavx

Read More
First use of AVX 256-bit vectors slows down 128-bit vector and AVX scalar ops...

assemblyx86-64ssesimdavx

Read More
Is it possible to use ymm16 - ymm31 for AVX2 vpcmpeq{size} instructions?...

assemblyx86-64avxavx2avx512

Read More
_mm256_load_ps cause segmentation fault with google/benchmark in debug mode...

c++segmentation-faultsimdavxgoogle-benchmark

Read More
How do the AVX(2) gather instructions actually compute the fetch address?...

c++simdintrinsicsavxavx2

Read More
Fastest way to set __m256 value to all ONE bits...

bit-manipulationintrinsicsavxavx2

Read More
AVX2 set __mm256d variable to all ones...

cvectorizationintrinsicsavxavx2

Read More
BackNext