Search code examples
Why does the compiler not always optimize away local variables?...


c++gcccompiler-optimizationmicro-optimization

Read More
Optimize Binary Search Algorithm...


calgorithmbinarymicro-optimization

Read More
Inlining of a recursive function...


c++recursioncompiler-optimizationmicro-optimizationinlining

Read More
Efficient division operator for 3D vectors...


c++operator-overloadingdivisionsfinaemicro-optimization

Read More
Why _umul128 works slower than scalar code for mul128x64x2 function?...


c++x86simdavxmicro-optimization

Read More
Missing latency in instruction tables...


assemblyx86micro-optimization

Read More
Why does instruction cache alignment improve performance in set associative cache implementations?...


cpu-architecturememory-alignmentcpu-cachemicro-optimization

Read More
What's the most concise way to reverse a string using x86 or x86_64 assembly?...


assemblyx86-64micro-optimizationcode-size

Read More
Backing field vs "value" keyword in property setter...


c#propertiesmicro-optimizationbacking-field

Read More
Most efficient way to find the greatest number not greater than A, which is divisible by B...


c++mathx86micro-optimizationinteger-division

Read More
Why are DateTime.Now DateTime.UtcNow so slow/expensive...


c#performancemicro-optimization

Read More
Fastest implementation of simple, virtual, observer-sort of, pattern in c++?...


c++enumsvirtual-functionsdispatchmicro-optimization

Read More
Passing as `const&` lightweight objects...


c++pass-by-referencepass-by-valuemicro-optimization

Read More
Which is generally faster to test for zero in x86 ASM: "TEST EAX, EAX" versus "TEST A...


performanceassemblyx86micro-optimization

Read More
Redundant value copying in assembly?...


assemblyoptimizationx86micro-optimization

Read More
Optimization of list comprehension with string matching...


regexpython-3.xoptimizationstring-matchingmicro-optimization

Read More
What C/C++ compiler can use push pop instructions for creating local variables, instead of just incr...


c++assemblyx86compiler-optimizationmicro-optimization

Read More
x86 Multiplication with 3: IMUL vs SHL + ADD...


assemblyx86x86-64intelmicro-optimization

Read More
Any difference in the performance of these two Java segments?...


javamicro-optimization

Read More
C++: Performance impact of if inside loops...


c++micro-optimizationpremature-optimization

Read More
Is it possible to convince clang to auto-vectorize this code without using intrinsics?...


vectorizationsimdllvm-clangmicro-optimizationavx2

Read More
fastest way to convert two-bit number to low-memory representation...


c++performancemicro-optimization

Read More
Loading an entire cache line at once to avoid contention for multiple elements of it...


c++multithreadingx86micro-optimizationfalse-sharing

Read More
What is the difference between _mm_movehdup_ps and _mm_shuffle_ps in this case?...


x86sseintrinsicsmicro-optimizationsse3

Read More
Comparing two values in the form (a + sqrt(b)) as fast as possible?...


c++optimizationalgebramicro-optimizationsqrt

Read More
Are there any builtins to tell compiler whether a branch is predictable?...


c++cgccllvm-clangmicro-optimization

Read More
Intel prefixes instructions, checking optimisations problems...


optimizationx86inteldisassemblymicro-optimization

Read More
Neon 64 bit aarch: compare vector to zero...


assemblyneonarm64micro-optimization

Read More
Is movsbl near ret good for performance?...


gcccpux86-64micro-optimization

Read More
Conditional function call without using other jumps?...


assemblyx86nasmmicro-optimization

Read More
BackNext