Search code examples
On x86-64, is the “movnti” or "movntdq" instruction atomic when system crash?...

x86-64atomiccpu-architecturessepersistent-memory

Read More
Multiplying and adding float numbers...

c++assemblyx86-64masmsse

Read More
Assembler invert XMM register...

assemblyssebitwise-not

Read More
Make previous NT stores visible to subsequent memory loads in other threads...

assemblyx86synchronizationssememory-barriers

Read More
How does the RMI Instruction Operand Encoding of ROUNDSS work?...

assemblyx86-64ssemachine-codeinstruction-encoding

Read More
Reverse a AVX register containing doubles using a single AVX intrinsic...

cssevectorizationsimdavx

Read More
What is the equivalent of v4sf and __attribute__ in Visual Studio C++?...

c++gccvisual-c++sseintrinsics

Read More
How to make an operation similar to _mm_extract_epi8 with non-immediate input?...

sseavx2

Read More
Do any JVM's JIT compilers generate code that uses vectorized floating point instructions?...

javafloating-pointjitssevectorization

Read More
Does FFTW determine SIMD version dynamically?...

simdsseavxfftwavx2

Read More
SSE-copy, AVX-copy and std::copy performance...

c++performancessesimdavx

Read More
access violation _mm_store_si128 SSE Intrinsics...

c++x86simdsseintrinsics

Read More
What is the purpose of the MoveMask for SSE and AVX...

.net-coref#x86sseavx

Read More
Can FP compares like SSE2 _mm_cmpeq_pd be used to compare 64 bit integers?...

simdsseavxsse2

Read More
Set XMM register via address location for X86-64...

c++assemblyx86-64sseasmjit

Read More
Storing two x86 32 bit registers into 128 bit xmm register...

assemblyx86simdsse

Read More
How does endianness work with SIMD registers?...

x86sseendiannesssimd

Read More
Is there a C++ function that returns exactly the value of the built-in CPU operation RSQRTSS for inv...

c++x86floating-pointssesqrt

Read More
Howto vblend for 32-bit integer? or: Why is there no _mm256_blendv_epi32?...

c++csseintrinsicsavx2

Read More
Dereference pointers in XMM register (gather)...

pointersssesimd

Read More
Extracting ints and shorts from a struct using AVX?...

c++x86ssesimdavx

Read More
load 32 bits from memory into xmm register...

sseinline-assemblyintrinsicssse2mmx

Read More
Intel Intrinsics guide - Latency and Throughput...

performancex86intelsseintrinsics

Read More
SIMD prefix sum on Intel cpu...

c++ssesimdprefix-sum

Read More
Is there a difference between SVML vs. normal intrinsic square root functions?...

c++intelsseintrinsicssse2

Read More
How to convert int 64 to int 32 with avx (but without avx-512)...

simdsseavx

Read More
int8 x uint8 matrix-vector product with column-major layout...

assemblyx86simdsseavx

Read More
Is the "throughput" listed by Intel per thread or per core?...

assemblyx86simdsseintrinsics

Read More
How do I enable SSE4.1 and SSE3 (but NOT AVX) in MSVC...

visual-c++ssesimdsse4

Read More
Are there unsigned equivalents of the x87 FILD and SSE CVTSI2SD instructions?...

assemblyfloating-pointssefloating-point-conversionx87

Read More
BackNext