Search code examples
Is it safe to compile one source with SSE2 another with AVX architecture?...


visual-c++sseintrinsicsavx

Read More
Shuffling a vector by number of bytes...


c++x86sseintrinsicsavx

Read More
Transpose 4x4 int32 matrix using NEON...


assemblyarmintrinsicsneon

Read More
Extract the low bit of each bool byte in a __m128i? bool array to packed bitmap...


c++gccsseintrinsics

Read More
How to compile program with _mm_clflushopt function? error: inlining failed...


cgcccompilationintrinsics

Read More
How to implement an efficient _mm256_madd_epi8 dot-products of groups of four i8 elements?...


c++x86simdintrinsicsavx2

Read More
Accumulating vector in __m128 using _mm_hadd_ps producing compile time error...


cintrinsics

Read More
Using Horizontal Neon intrinsics efficiently...


assemblyinline-assemblyarm64intrinsicsneon

Read More
How to convert 32-bit float to 8-bit signed char? (4:1 packing of int32 to int8 __m256i)...


cx86simdintrinsicsavx2

Read More
use c's `nmmintrin.h` in zig...


cintrinsicszig

Read More
using !Ref in second argument in SAM template...


amazon-web-servicesyamlaws-cloudformationintrinsicssam

Read More
Efficiently extract single double element from AVX-512 vector...


simdintrinsicsavx512

Read More
Fastest way to implement _mm256_mullo_epi4 using AVX2...


cx86-64intrinsicsavxavx2

Read More
How to multiply-accumulate unsigned bytes into 32-bit elements without overflow with RISC-V extensio...


cvectorizationsimdintrinsicsriscv

Read More
Usage of __AVX512F__ in Visual Studio for compiling code...


c++visual-studiovisual-c++intrinsicsavx512

Read More
Are there macros for SIMD instruction sets?...


c#simdintrinsics

Read More
Counter-intuitive results while playing with intrinsics...


c++simdintrinsicsavx2microbenchmark

Read More
Testing for builtins/intrinsics...


cgccintrinsics

Read More
Adding 3D vectors using SIMD intrinsics...


c++vectorizationsimdintrinsicsavx2

Read More
Why do compilers not coerce "n / 2.0" into "n * 0.5" if it's faster?...


c++ccompiler-optimizationintrinsics

Read More
How to calculate 2x2 matrix multiplied by 2D vector using SSE intrinsics (32 bit floating points)? (...


c++optimizationmatrix-multiplicationsseintrinsics

Read More
Is there a list of all compiler intrinsic function for Delphi by version?...


delphiintrinsics

Read More
Extracting edges of AVX2 16x16 bitmatrix...


cbit-manipulationintrinsicsavx2

Read More
"Intrinsics" possible on GPU on OpenGL?...


performanceopenglgpuglslintrinsics

Read More
bitpack ascii string into 7-bit binary blob using SIMD...


casciisimdsseintrinsics

Read More
Do I need to use _mm256_zeroupper in 2021?...


c++ssesimdintrinsicsavx

Read More
SSE divrem memory store requirements...


memorysimdsseintrinsicsicc

Read More
How would I define the __m256i data type in Ada?...


simdadaintrinsicsavx2gnat

Read More
In SIMD, SSE2,many instructions named as "_mm_set_epi8","_mm_cmpgt_epi8 " and so...


c++simdsseintrinsicssse2

Read More
Optimizing find_first_not_of with SSE4.2 or earlier...


stringoptimizationsseintrinsicssse4

Read More
BackNext