Search code examples
Find position of the unique set bit in 32-bit number...

c++assemblyx86bit-manipulationintrinsics

Read More
SSE intrinsics atan2...

c++trigonometrysimdsseintrinsics

Read More
AVX512-FP16 intrinsics fails in release mode, works in debug...

visual-studiointrinsicsavx512

Read More
SIMD _mm_store_si128 | _mm_storeu_si128 don't storing correctly...

c++simdintrinsicsinstruction-set

Read More
Seg fault while using _mm256_i64gather_pd...

c++intrinsicsavxavx2

Read More
Difference between _mm_storeu_si128 and _mm_loadu_si128...

csseintrinsics

Read More
Is it safe to compile one source with SSE2 another with AVX architecture?...

visual-c++sseintrinsicsavx

Read More
Shuffling a vector by number of bytes...

c++x86sseintrinsicsavx

Read More
Transpose 4x4 int32 matrix using NEON...

assemblyarmintrinsicsneon

Read More
Extract the low bit of each bool byte in a __m128i? bool array to packed bitmap...

c++gccsseintrinsics

Read More
How to compile program with _mm_clflushopt function? error: inlining failed...

cgcccompilationintrinsics

Read More
How to implement an efficient _mm256_madd_epi8 dot-products of groups of four i8 elements?...

c++x86simdintrinsicsavx2

Read More
Accumulating vector in __m128 using _mm_hadd_ps producing compile time error...

cintrinsics

Read More
Using Horizontal Neon intrinsics efficiently...

assemblyinline-assemblyarm64intrinsicsneon

Read More
How to convert 32-bit float to 8-bit signed char? (4:1 packing of int32 to int8 __m256i)...

cx86simdintrinsicsavx2

Read More
use c's `nmmintrin.h` in zig...

cintrinsicszig

Read More
using !Ref in second argument in SAM template...

amazon-web-servicesyamlaws-cloudformationintrinsicssam

Read More
Efficiently extract single double element from AVX-512 vector...

simdintrinsicsavx512

Read More
Fastest way to implement _mm256_mullo_epi4 using AVX2...

cx86-64intrinsicsavxavx2

Read More
How to multiply-accumulate unsigned bytes into 32-bit elements without overflow with RISC-V extensio...

cvectorizationsimdintrinsicsriscv

Read More
Usage of __AVX512F__ in Visual Studio for compiling code...

c++visual-studiovisual-c++intrinsicsavx512

Read More
Are there macros for SIMD instruction sets?...

c#simdintrinsics

Read More
Counter-intuitive results while playing with intrinsics...

c++simdintrinsicsavx2microbenchmark

Read More
Testing for builtins/intrinsics...

cgccintrinsics

Read More
Adding 3D vectors using SIMD intrinsics...

c++vectorizationsimdintrinsicsavx2

Read More
Why do compilers not coerce "n / 2.0" into "n * 0.5" if it's faster?...

c++ccompiler-optimizationintrinsics

Read More
How to calculate 2x2 matrix multiplied by 2D vector using SSE intrinsics (32 bit floating points)? (...

c++optimizationmatrix-multiplicationsseintrinsics

Read More
Is there a list of all compiler intrinsic function for Delphi by version?...

delphiintrinsics

Read More
Extracting edges of AVX2 16x16 bitmatrix...

cbit-manipulationintrinsicsavx2

Read More
"Intrinsics" possible on GPU on OpenGL?...

performanceopenglgpuglslintrinsics

Read More
BackNext