Search code examples
bitpack ascii string into 7-bit binary blob using SIMD...

casciisimdsseintrinsics

Read More
Do I need to use _mm256_zeroupper in 2021?...

c++ssesimdintrinsicsavx

Read More
SSE divrem memory store requirements...

memorysimdsseintrinsicsicc

Read More
Get sum of values stored in __m256d with SSE/AVX...

c++optimizationsseavxavx2

Read More
In SIMD, SSE2,many instructions named as "_mm_set_epi8","_mm_cmpgt_epi8 " and so...

c++simdsseintrinsicssse2

Read More
Optimizing find_first_not_of with SSE4.2 or earlier...

stringoptimizationsseintrinsicssse4

Read More
SSE _mm_movemask_epi8 equivalent method for ARM NEON...

armsseneon

Read More
SSE/AVX: Choose from two __m256 float vectors based on per-element min and max absolute value...

sseintrinsicsavxavx512

Read More
Should you pass __m128 (and other register types) by reference or by copy?...

c++simdsseintrinsics

Read More
Why is masking needed before using a pshufb shuffle as a lookup table for nibbles?...

c++simdsseavxavx2

Read More
_mm_srli_si128 equivalent on altivec...

csimdssepowerpcaltivec

Read More
How to convert a hex float to a float in C/C++ using _mm_extract_ps SSE GCC instrinc function...

c++gccfloating-pointhexsse

Read More
X86: How to set lower half of xmm0 to 0, without affecting the upper half?...

assemblyx86ssesimdmicro-optimization

Read More
SSE: convert __m128 to float...

c++csse

Read More
Nibble shuffling with x64 SIMD...

x86-64simdsse

Read More
How to check if a CPU supports the SSE3 instruction set?...

c++sseinstruction-setavxcpuid

Read More
Fast vectorized conversion from RGB to BGRA...

copenglssesimdvectorization

Read More
Is there a way to force visual studio to generate aligned instructions from SSE intrinsics?...

visual-studiovisual-c++sseintrinsicsmemory-alignment

Read More
What is the Default addition Operator '+' of __m64...

cx86sseintrinsicsmmx

Read More
How much effort do you have to put in to get gains from using SSE?...

c++sse

Read More
How to get the number of unique elements of a simd vector in C...

csimdsse

Read More
How to compare two vectors using SIMD and get a single boolean result?...

assemblyx86ssesimd

Read More
sse4 packed sum between int32_t and int16_t (sign extend to int32_t)...

linuxx86-64intelsse

Read More
Find index of unaligned int or long in byte array using SIMD...

.netvectorizationsimdsse

Read More
Is there an equivalent of _mm_slli_si128(__m128i a, int num) for floats?...

cx86vectorizationsseintrinsics

Read More
Efficient transpose of 2D nibble matrix?...

cbit-manipulationsimdsseavx2

Read More
How to check inf for AVX intrinsic __m256...

c++csseintrinsicsavx

Read More
What is the difference between sse2neon and arm_neon.h?...

amazon-web-servicesssearm64neon

Read More
SSE/AVX: using float shuffles + casts as substitute for missing integer shuffle intrinsics?...

x86sseavx

Read More
How to convert an unsigned integer to floating-point in x86 (32-bit) assembly?...

assemblyx86sse

Read More
BackNext