Search code examples
_mm256_rem_epu64 intrinsic not found with GCC 10.3.0...

c++simdavxavx512

Read More
AVX512 compare to vector not to mask...

x86-64avx512

Read More
AVX-512: _mm512_load vs. standard pointer casting?...

cintrinsicsavx512

Read More
why glibc memcpy not choose avx512 version?...

ccompiler-optimizationglibcavx512

Read More
What is the most efficient way to clear a single or a few ZMM registers on Knights Landing?...

assemblyavxxeon-phiavx512knights-landing

Read More
AVX-512 - How to gather data from memory using assembly instruction?...

c++assemblynasmsimdavx512

Read More
AVX-512 - Debugging application with Intel SDE not working...

c++debugginggdbintelavx512

Read More
How to get AVX512 in C#?...

c#simdavxavx512

Read More
SIMD: Bit-pack signed integers...

ssesimdavxavx2avx512

Read More
Is it possible to use ymm16 - ymm31 for AVX2 vpcmpeq{size} instructions?...

assemblyx86-64avxavx2avx512

Read More
Is there way to automatically replace avx512 with avx2?...

gccavx512

Read More
_mm256_loadu_epi64, _mm256_storeu_epi64 require avx512vl?...

c++clangintrinsicsavx2avx512

Read More
How Are Registers X/Y/ZMM16-31 Encoded?...

assemblyx86-64disassemblymachine-codeavx512

Read More
What is the "correct" way to go from avx/sse masks to avx512 masks?...

c++sseavxavx512

Read More
The usage of writemask k1 in AVX-512 VORPS?...

assemblyx86-64inline-assemblyattavx512

Read More
GNU C inline asm input constraint for AVX512 mask registers (k1...k7)?...

cgccassemblyinline-assemblyavx512

Read More
Why don't wider versions of VDPPD / VDPPS exist, like 512-bit bit?...

assemblyx86simdavxavx512

Read More
GCC inclusion of AVX512's "Fused Multiply Add" instructions when compiling for Cascade...

gccintelcpu-architectureavx512fma

Read More
Multiplying different types in AVX512...

c++csseavxavx512

Read More
What are the differences between the compress and expand instructions in AVX-512?...

assemblyx86simdavx512

Read More
Emulating shifts on 64 bytes with AVX-512...

simdavx512

Read More
Can AVX2-compiled program still use 32 registers of an AVX-512 capable CPU?...

x86compiler-optimizationcpu-architecturecpu-registersavx512

Read More
4-way bytewise interleave 4x 16-byte vectors from memory, with AVX512...

x86x86-64micro-optimizationavx512

Read More
Find the INDEX of element having max. absolute value using AVX512 instructions...

cmaxinstructionsavx512

Read More
Best way to store 256 bit AVX vectors into unsigned long integers...

cvectoravxavx2avx512

Read More
Interleaved merging of 2 AVX-512 vector elements - C intrinsic...

chpcintrinsicsavxavx512

Read More
Fastest way to calculate a digit-sum for a large number (as a decimal string)...

cassemblysseintrinsicsavx512

Read More
Fastest method to calculate sum of all packed 32-bit integers using AVX512 or AVX2...

cintrinsicsavxavx2avx512

Read More
How to achieve the effect of vpmovmskb on ZMM registers?...

assemblyx86bitmaskavx512

Read More
In GNU C inline asm, what are the size-override modifiers for xmm/ymm/zmm for a single operand?...

cgccsseinline-assemblyavx512

Read More
BackNext