Search code examples
What does the [Intrinsic] attribute in C# do?...

c#.net.net-coreintrinsics

Read More
SIMD instructions on contiguous iterators...

c++iteratorssesimdintrinsics

Read More
Why does gcc -O3 handle avx256 compare intrinsic differently than gcc -O0 and clang?...

cgccsimdintrinsicsavx

Read More
How can I gather single bytes with AVX512 intrinsics, given a vector of int offsets?...

cssesimdintrinsicsavx512

Read More
How to extend a int32x2_t to a int32x4_t with NEON intrinsics on clang/AArch64 when you don't ca...

armsimdintrinsicsarm64neon

Read More
What is the difference between loadu/lddqu and assignment operator?...

cssesimdintrinsics

Read More
Does an aborted xbegin transaction restore the stack context that existed at the xbegin start?...

c++x86intrinsicsintel-tsx

Read More
Cast from double to __m128...

c++assemblysseinline-assemblyintrinsics

Read More
Parallel bit deposit / parallel bit extract on intel compiler/LLVM?...

gccclangintrinsicsiccbmi

Read More
Leading zeros calculation with intrinsic function...

armbit-manipulationwindows-ceintrinsicsleading-zero

Read More
What is the difference between _mm_set1_ps and _mm_set_ps1?...

csseintrinsics

Read More
Matrix-Vector and Matrix-Matrix multiplication using SSE...

c++ssematrix-multiplicationintrinsicsvector-multiplication

Read More
How to take the high part of __m256...

cpointersassemblyintrinsicsavx

Read More
Fastest way to initialize a __m128i constant with intrinsics?...

cvisual-c++sseintrinsicsmicro-optimization

Read More
Why and when to use __noop?...

c++visual-c++intrinsics

Read More
_mm256_movemask_epi8 to uint64_t...

c++visual-c++type-conversionintrinsicssign-extension

Read More
AVX: "to 1 if not zero"...

c++sseintrinsicsavx

Read More
How to sum __m256 horizontally?...

ssevectorizationintrinsicsavx

Read More
Accessing 32bit from 64bit using ARM Neon intrinsics...

carmsimdintrinsicsneon

Read More
Vectorizing a loop over float x,y,z arrays calculating length and differences using SSE Intrinsics...

coptimizationvectorizationsseintrinsics

Read More
How to add an AVX2 vector horizontally 3 by 3?...

cx86simdintrinsicsavx2

Read More
Summing 8-bit integers in __m512i with AVX intrinsics...

cx86simdintrinsicsavx

Read More
Dividing packed 16-bit integer with mask using AVX512 or SVML intrinsics...

cintrinsicsavxavx512

Read More
Converting packed 64-bit integers to packed 8-bit integers with signed saturation using AVX512...

cintrinsicsavxavx512

Read More
clflush to invalidate cache line via C function...

cperformancex86intrinsicscpu-cache

Read More
Do I get a performance penalty when mixing SSE integer/float SIMD instructions...

cassemblyssesimdintrinsics

Read More
c++ AVX512 intrinsic equivalent of _mm256_broadcast_ss()?...

c++intelintrinsicsavx2avx512

Read More
cmake CheckSymbolExists for intrinsic...

cmakeintelintrinsics

Read More
How to enable instrinsic functions from the preprocessor...

cgccbit-manipulationintrinsicsinstruction-set

Read More
Intel store instructions on delibrately overlapping memory regions...

c++intrinsicsavx

Read More
BackNext