Search code examples
Is this a missed optimization in GCC, loading an 16-bit integer value from .rodata instead of an imm...


cgccx86-64compiler-optimizationmicro-optimization

Read More
How is a critical path formed when there is a data dependency between a loop iterations while a CPU ...


performanceassemblyx86-64cpu-architecturemicro-optimization

Read More
Why is POP slow when using register R12?...


performancex86intelcpu-architecturemicro-optimization

Read More
Is it more efficient to multiply within the address displacement or outside it?...


assemblyoptimizationx86micro-optimizationaddressing-mode

Read More
what is faster: in_array or isset?...


phpperformancemicro-optimization

Read More
What is faster in Python, "while" or "for xrange"...


pythonmicro-optimization

Read More
Should I use Java's String.format() if performance is important?...


javastringperformancestring-formattingmicro-optimization

Read More
How to unroll a loop of a dot product in mips after re-ordering instructions?...


assemblymipscpu-architecturemicro-optimizationloop-unrolling

Read More
Most compact way to test for a negative number in x86 assembly?...


assemblyx86micro-optimization

Read More
test $x,%dil vs. test $x,%edi...


assemblyoptimizationx86-64attmicro-optimization

Read More
Why should code be aligned to even-address boundaries on x86?...


assemblyx86memory-alignmentmicro-optimization

Read More
Any possible code that can flip a bit/integer/bool between 0 and 1 in single CPU instruction...


c++cassemblyx86micro-optimization

Read More
A checklist for Spacy optimization?...


optimizationnlpspacymicro-optimization

Read More
Delete a node from a linked list in C...


coptimizationdata-structureslinked-listmicro-optimization

Read More
Perf record after my code reaches a certain point?...


x86-64perfmicro-optimization

Read More
X86: How to set lower half of xmm0 to 0, without affecting the upper half?...


assemblyx86ssesimdmicro-optimization

Read More
Passing a struct with constant data to a function in C...


cgccstructcompiler-optimizationmicro-optimization

Read More
Testing an Algorithms speed. How?...


cperformanceprofilingbenchmarkingmicro-optimization

Read More
AVX512BW: handle 64-bit mask in 32-bit code with bsf / tzcnt?...


assemblyx8632-bitmicro-optimizationavx512

Read More
For what values would accessing single bytes for XOR logic work faster than multiplication to find i...


cperformancex86micro-optimization

Read More
How can I see which i686 instructions are faster...


assemblyx86cpu-architectureosdevmicro-optimization

Read More
What code is more CPU expensive: while(*p) or while(i--)?...


cmicro-optimization

Read More
Using SIMD/AVX/SSE for tree traversal...


performanceassemblysimdmicro-optimizationavx

Read More
' ... != null' or 'null != ....' best performance?...


javaperformancemicro-optimization

Read More
LEA vs MOV + INC for incrementing a variable and storing it in another one?...


assemblyx86-64intelmicro-optimization

Read More
What's the most efficient way to swap 4 16-bit integers on a 64-bit processor?...


c++assemblymicro-optimization

Read More
Is there a way to "unfetch" a cache line?...


cachingcompiler-optimizationcpu-cachemicro-optimizationprefetch

Read More
Avoiding the overhead of C# virtual calls...


c#virtual-functionsmicro-optimization

Read More
Is it more efficient to return a string literal or a const string in each subtype in Java?...


javamemorymicro-optimization

Read More
Why is strtolower slightly slower than strtoupper?...


phpcperformancemicro-optimizationmicrobenchmark

Read More
BackNext