Search code examples
What is the best way to set a register to zero in x86 assembly: xor, mov or and?...


performanceassemblyoptimizationx86micro-optimization

Read More
What is faster in C++: mod (%) or another counter?...


c++performanceassemblymicro-optimizationbranch-prediction

Read More
Advantage of using LEA over MOV for passing parameters in Assembly compiled from C++...


c++assemblyvisual-c++x86-64micro-optimization

Read More
How to write a custom exception class derived from std::invalid_argument?...


c++exceptionc-stringsstdstringmicro-optimization

Read More
Is there a faster algorithm for max(ctz(x), ctz(y))?...


c++algorithmrustbit-manipulationmicro-optimization

Read More
AVX2 code cannot be faster than gcc base optmization...


c++performancex86-64micro-optimizationavx2

Read More
How do I optimize a block copy and right shift + saturate to max=5, for Cortex-M3...


armcortex-mmicro-optimizationthumb

Read More
How do I reduce execution time and number of cycles for a factorial loop? And/or code-size?...


armcortex-mmicro-optimizationexecution-timethumb

Read More
Fastest polling loop - how can I trim 1 CPU cycle?...


assemblycortex-mmicro-optimization

Read More
Missing optimization: mov al, [mem] to bitfield-insert a new low byte into an integer...


cassemblyx86-64micro-optimization

Read More
Why do none of the major compilers optimize this conditional store that checks if the value is alrea...


c++compiler-optimizationmicro-optimization

Read More
uiCA assembly code check dosen't detect JCC erratum...


assemblyx86-64micro-optimization

Read More
Filling an AVX512 register with incrementing bytes...


assemblyoptimizationx86-64micro-optimizationavx512

Read More
Assembly handwritten function slower than GCC compiled function...


assemblyx86-64cpu-architecturememory-alignmentmicro-optimization

Read More
Assembly function's data arrangment in data section...


assemblyx86-64cpu-architecturemicro-optimization

Read More
ADD slower than ADC in the first step of a bigint multiply on Coffee Lake (Skylake)...


performanceassemblyx86cpu-architecturemicro-optimization

Read More
AND + CMP or SHR + CMP?...


coptimizationcpu-architectureportabilitymicro-optimization

Read More
x86-64 instruction to AND until zero?...


assemblybit-manipulationx86-64micro-optimization

Read More
Is it possible to get the unsigned quotient and remainer at once in C?...


cperformancedivisionmicro-optimizationunsigned-integer

Read More
what is the purpose of using index caches in rigtorp's SPSCQueue...


queuecpu-architecturecpu-cachemicro-optimizationlock-free

Read More
Use two loop bodies or one (result identical)?...


coptimizationcachingcpumicro-optimization

Read More
Which Intel microarchitecture introduced the ADC reg,0 single-uop special case?...


performanceassemblyx86intelmicro-optimization

Read More
How to strip debug symbols for real in Xcode?...


xcodemacosstripmicro-optimizationdebug-symbols

Read More
"enter" vs "push ebp; mov ebp, esp; sub esp, imm" and "leave" vs &quot...


assemblyx86stackmicro-optimizationstack-frame

Read More
How to properly increment some array key, even if key needs to be created?...


phpoptimizationmicro-optimization

Read More
Mixing SSE with AVX128 for shorter instructions?...


assemblyx86sseavxmicro-optimization

Read More
Is it useful to use VZEROUPPER if your program+libraries contain no SSE instructions?...


performanceassemblyx86avxmicro-optimization

Read More
C optimization: conditional store to avoid dirtying a cache line...


ccachingcpu-cachemicro-optimizationlibuv

Read More
Setting and clearing the zero flag in x86...


performanceassemblyx86x86-64micro-optimization

Read More
Cost of exception handlers in Python...


pythonperformanceexceptionmicro-optimization

Read More
BackNext