Search code examples
Add two vectors (uint64_t type) with saturation for each int8_t element...


assemblybit-manipulationx86-64saturation-arithmeticswar

Read More
Can packing variables or parameters into structures/unions introduce unforseen performance penalties...


cgccstructmicro-optimizationswar

Read More
Fastest way to find 16bit match in a 4 element short array?...


cx86-64micro-optimizationswar

Read More
SIMD-within-a-register version of min/max...


cassemblybit-manipulationarm64swar

Read More
Multiplication of two packed signed integers in one...


c++bit-manipulationstockfishswar

Read More
How does this algorithm to count the number of set bits in a 32-bit integer work?...


c++calgorithmhammingweightswar

Read More
How to write a SWAR comparison which puts 0xFF in a lane on matches?...


cbit-manipulationswar

Read More
SWAR byte counting methods from 'Bit Twiddling Hacks' - why do they work?...


cbit-manipulationswar

Read More
How 0x01010101 is equivalent to 1<<24 + 1<<16 + 1<<8 + 1...


binarybit-manipulationbitwise-operatorsbit-shiftswar

Read More
Can a register hold multiple values at a time?...


assemblyx86-64simdcpu-registersswar

Read More
Subtracting packed 8-bit integers in an 64-bit integer by 1 in parallel, SWAR without hardware SIMD...


c++cbit-manipulationsimdswar

Read More
Compare 64-bit integers by segments...


c++bit-manipulationswar

Read More
How does this color blending trick that works on color components in parallel work?...


optimizationlanguage-agnosticbit-manipulationcolor-blendingswar

Read More
BackNext