Search code examples
What are the best instruction sequences to generate vector constants on the fly?...


assemblyx86ssesimdavx

Read More
Do I need to use _mm256_zeroupper in 2021?...


c++ssesimdintrinsicsavx

Read More
SSE divrem memory store requirements...


memorysimdsseintrinsicsicc

Read More
Get sum of values stored in __m256d with SSE/AVX...


c++optimizationsseavxavx2

Read More
In SIMD, SSE2,many instructions named as "_mm_set_epi8","_mm_cmpgt_epi8 " and so...


c++simdsseintrinsicssse2

Read More
Optimizing find_first_not_of with SSE4.2 or earlier...


stringoptimizationsseintrinsicssse4

Read More
SSE _mm_movemask_epi8 equivalent method for ARM NEON...


armsseneon

Read More
SSE/AVX: Choose from two __m256 float vectors based on per-element min and max absolute value...


sseintrinsicsavxavx512

Read More
Should you pass __m128 (and other register types) by reference or by copy?...


c++simdsseintrinsics

Read More
Why is masking needed before using a pshufb shuffle as a lookup table for nibbles?...


c++simdsseavxavx2

Read More
Classification of x86 instructions according to floating point rounding mode sensitivity?...


assemblyfloating-pointx86-64sserounding-error

Read More
_mm_srli_si128 equivalent on altivec...


csimdssepowerpcaltivec

Read More
How to convert a hex float to a float in C/C++ using _mm_extract_ps SSE GCC instrinc function...


c++gccfloating-pointhexsse

Read More
X86: How to set lower half of xmm0 to 0, without affecting the upper half?...


assemblyx86ssesimdmicro-optimization

Read More
SSE: convert __m128 to float...


c++csse

Read More
Nibble shuffling with x64 SIMD...


x86-64simdsse

Read More
How to check if a CPU supports the SSE3 instruction set?...


c++sseinstruction-setavxcpuid

Read More
Fast vectorized conversion from RGB to BGRA...


copenglssesimdvectorization

Read More
Is there a way to force visual studio to generate aligned instructions from SSE intrinsics?...


visual-studiovisual-c++sseintrinsicsmemory-alignment

Read More
What is the Default addition Operator '+' of __m64...


cx86sseintrinsicsmmx

Read More
How much effort do you have to put in to get gains from using SSE?...


c++sse

Read More
How to get the number of unique elements of a simd vector in C...


csimdsse

Read More
How to compare two vectors using SIMD and get a single boolean result?...


assemblyx86ssesimd

Read More
sse4 packed sum between int32_t and int16_t (sign extend to int32_t)...


linuxx86-64intelsse

Read More
Find index of unaligned int or long in byte array using SIMD...


.netvectorizationsimdsse

Read More
Is there an equivalent of _mm_slli_si128(__m128i a, int num) for floats?...


cx86vectorizationsseintrinsics

Read More
Efficient transpose of 2D nibble matrix?...


cbit-manipulationsimdsseavx2

Read More
How to check inf for AVX intrinsic __m256...


c++csseintrinsicsavx

Read More
What is the difference between sse2neon and arm_neon.h?...


amazon-web-servicesssearm64neon

Read More
SSE/AVX: using float shuffles + casts as substitute for missing integer shuffle intrinsics?...


x86sseavx

Read More
BackNext