Search code examples
Reference implementation of vrecpeq_f32 intrinsic?...

c++simdintrinsicsneon

Read More
Howto vblend for 32-bit integer? or: Why is there no _mm256_blendv_epi32?...

c++csseintrinsicsavx2

Read More
load 32 bits from memory into xmm register...

sseinline-assemblyintrinsicssse2mmx

Read More
Intel Intrinsics guide - Latency and Throughput...

performancex86intelsseintrinsics

Read More
How to read the "Intel Intrinsics Guide"?...

intelsimdintrinsics

Read More
Is there a difference between SVML vs. normal intrinsic square root functions?...

c++intelsseintrinsicssse2

Read More
Is the "throughput" listed by Intel per thread or per core?...

assemblyx86simdsseintrinsics

Read More
What's the difference between logical SSE intrinsics?...

cssesimdintrinsicssse2

Read More
128-bit division intrinsic in Visual C++...

visual-c++intrinsicsinteger-division128-bit

Read More
Does _mm_stream_load_si128 (movntdqa) modify the memory its argument points to?...

cassemblyx86sseintrinsics

Read More
How to interleave 3 float vectors into an array with AVX intrinsics C++...

c++simdintrinsicsavxavx2

Read More
fill a zmm from two ymms in C...

cintrinsicsavx2avx512

Read More
Finding Next Ascii Space With _mm_cmpeq_epi8 Returning 0...

csseintrinsics

Read More
_BitScanForward _BitScanForward64 missing (VS2017) Snappy...

c++visual-c++x86bit-manipulationintrinsics

Read More
can I assign the result of intrinsic that returns __m128i to variable of the type__m128i_u?...

simdsseintrinsicssse2

Read More
How can I extract a byte from __m256i AVX2 register into another __m256i register?...

csimdintrinsicsavxavx2

Read More
unexpected _mm256_shuffle_epi with __256i vectors...

c++intrinsicsavxavx2

Read More
Intrinsic definition in magma...

importintrinsicsmagma-ca

Read More
my intrinsic function in getting the dot product of an int array is slower than the normal code, wha...

c++cpusseintrinsicsdot-product

Read More
how to debug a _mm_mul_ps function?...

c++segmentation-faultssesimdintrinsics

Read More
Why does inverting the parameters to a CMPGT comparison function work as a CMPLT?...

c++sseintrinsicsavx2

Read More
Are there any common fixed-point intrinsics?...

x86-64divisionintrinsicsfixed-pointsqrt

Read More
What doest `vaddhn_high_s16` actually do?...

c++simdintrinsicsarm64neon

Read More
Operands for VPCMPB...

assemblyx86-64intrinsicsavx512

Read More
How _mm_prefetch works?...

assemblycachingsseintrinsicsprefetch

Read More
AVX-512: _mm512_load vs. standard pointer casting?...

cintrinsicsavx512

Read More
Is there an AVX2 instruction (and intrinsic) to broadcast load a 16 bit value 16 times into an __m25...

c++sseintrinsicsavxavx2

Read More
Check XMM register for all zeroes...

c++ssesimdintrinsics

Read More
How to load 16 bytes of memory into a Rust __m128i?...

rustssesimdintrinsics

Read More
How to combine constexpr and vectorized code?...

c++openmpconstexprintrinsics

Read More
BackNext