Search code examples
x86 SIMD instructions 16 byte alignment in assembly (Without C intrinsics)...

assemblyx86-64ssesimdmemory-alignment

Read More
Expand the lower two 32-bit floats of an xmm register to the whole xmm register...

assemblyx86sse

Read More
Get member of __m128 by index?...

c++clangssesimdintrinsics

Read More
Writing a portable SSE/AVX version of std::copysign...

c++x86-64ssesimdavx

Read More
SSE optimization of Gaussian blur...

c++optimizationssesimdgaussianblur

Read More
How to calculate mod/remainder using SSE?...

assemblyssedivision

Read More
Most recent processor without support of SSSE3 instructions?...

x86ssesimdinstruction-set

Read More
How to combine two __m128 values to __m256?...

cx86ssesimdavx

Read More
Vectorization of modulo multiplication...

c++algorithmssesimdavx

Read More
Comparing quadwords in xmm...

assemblyx86nasmsse

Read More
Libc hypot function seems to return incorrect results for double type... why?...

c++floating-pointsseglibchypotenuse

Read More
Why move 32-bit register to stack then from stack to xmm register?...

assemblyx86sseattmicro-optimization

Read More
Set an XMM register to a repeating byte pattern (broadcast a constant byte)...

assemblyssemicro-optimizationsse2

Read More
Multiplying different types in AVX512...

c++csseavxavx512

Read More
Why does GCC or Clang not optimise reciprocal to 1 instruction when using fast-math...

c++ssecompiler-optimizationsimdfast-math

Read More
How do I clamp __m128i signed integers into non-negative unsigned integers in SSE...

c++maxsseclamp

Read More
is it safe to use xmm registers to save the general-purpose ones?...

assemblyx86sseinline-assembly

Read More
Is there a shift 128/256 bits by 1 instruction?...

ssesimdavx

Read More
Truncating an xmm floating-point register to a 64-bit register...

floating-pointx86-64ssetruncatefloating-point-conversion

Read More
How to allocate 16byte memory aligned data...

cmemorysseicc

Read More
Migrating from XMM to YMM...

assemblymasmssesimdavx

Read More
SSE _mm_dp_ps size result...

csseintrinsics

Read More
Accumulate vector of integer with sse...

c++vectorx86ssesimd

Read More
Fast transposition of an image and Sobel Filter optimization in C (SIMD)...

coptimizationssesimd

Read More
SSE vector realign?...

sseavxavx2

Read More
Is it okay to mix legacy SSE encoded instructions and VEX encoded ones in the same code path?...

assemblyx86sseavx

Read More
What's So Difficult About `uint64_t`? (Conversion Assembly From `float`)...

assemblyfloating-pointx86-64ssefloating-point-conversion

Read More
How can I disable vectorization while using GCC?...

gccvectorizationssesimdauto-vectorization

Read More
How to compare __m128 types?...

x86ssesimd

Read More
Simulating packusdw functionality with SSE2...

x86sseintrinsicssse2sse4

Read More
BackNext