Search code examples
Divide by 10 using bit shifts?...

mathbitmicro-optimizationlow-levelinteger-division

Read More
How exactly do partial registers on Haswell/Skylake perform? Writing AL seems to have a false depend...

assemblyx86intelcpu-architecturemicro-optimization

Read More
Why does mulss take only 3 cycles on Haswell, different from Agner's instruction tables? (Unroll...

cassemblyx86ssemicro-optimization

Read More
Converting nucleobase representation from ASCII to UCSC .2bit...

calgorithmbit-manipulationbioinformaticsmicro-optimization

Read More
Can packing variables or parameters into structures/unions introduce unforseen performance penalties...

cgccstructmicro-optimizationswar

Read More
Floating point division vs floating point multiplication...

c++floating-pointmicro-optimization

Read More
Controlling class member layout AND destructor order...

c++classconstructordestructormicro-optimization

Read More
JavaScript: Is the `if / else` statement faster than the conditional statement in?...

javascriptperformanceif-statementconditional-statementsmicro-optimization

Read More
Do most compilers optimize MATMUL(TRANSPOSE(A),B)?...

fortrangfortranintel-fortranmicro-optimization

Read More
Is x >= 0 more efficient than x > -1?...

c++optimizationmicro-optimization

Read More
Fastest way to find 16bit match in a 4 element short array?...

cx86-64micro-optimizationswar

Read More
Why XOR before SETcc?...

c++assemblyx86micro-optimization

Read More
In assembly, should branchless code use complementary CMOVs?...

assemblyx86micro-optimizationbranchlessconditional-move

Read More
Fast sign of integer in C...

cmicro-optimizationsigned-integer

Read More
How can I perform a branchless conditional arithmetic operation in C?...

cbit-manipulationmicro-optimizationtwos-complementbranchless

Read More
What are the pros and cons of int, unsigned int, uint_fastN_t, and int_fastN_t?...

cperformanceintegermicro-optimization

Read More
Why was the [[unlikely]] attribute used in this example?...

c++cpuc++20c++-chronomicro-optimization

Read More
What is the best way to set a register to zero in x86 assembly: xor, mov or and?...

performanceassemblyoptimizationx86micro-optimization

Read More
What is faster in C++: mod (%) or another counter?...

c++performanceassemblymicro-optimizationbranch-prediction

Read More
Advantage of using LEA over MOV for passing parameters in Assembly compiled from C++...

c++assemblyvisual-c++x86-64micro-optimization

Read More
How to write a custom exception class derived from std::invalid_argument?...

c++exceptionc-stringsstdstringmicro-optimization

Read More
Is there a faster algorithm for max(ctz(x), ctz(y))?...

c++algorithmrustbit-manipulationmicro-optimization

Read More
AVX2 code cannot be faster than gcc base optmization...

c++performancex86-64micro-optimizationavx2

Read More
How do I optimize a block copy and right shift + saturate to max=5, for Cortex-M3...

armcortex-mmicro-optimizationthumb

Read More
How do I reduce execution time and number of cycles for a factorial loop? And/or code-size?...

armcortex-mmicro-optimizationexecution-timethumb

Read More
Fastest polling loop - how can I trim 1 CPU cycle?...

assemblycortex-mmicro-optimization

Read More
Missing optimization: mov al, [mem] to bitfield-insert a new low byte into an integer...

cassemblyx86-64micro-optimization

Read More
Why do none of the major compilers optimize this conditional store that checks if the value is alrea...

c++compiler-optimizationmicro-optimization

Read More
uiCA assembly code check dosen't detect JCC erratum...

assemblyx86-64micro-optimization

Read More
Filling an AVX512 register with incrementing bytes...

assemblyoptimizationx86-64micro-optimizationavx512

Read More
BackNext