Search code examples
Why doesn’t Clang use vcnt for __builtin_popcountll on AArch32?...

armclangbit-manipulationmicro-optimizationneon

Read More
C++ different using declarations for different concepts...

c++optimizationc++20micro-optimizationc++-concepts

Read More
Assembly function address table and data under the function or in data section...

assemblyx86x86-64micro-optimizationstatic-data

Read More
Is reading the `length` property of an array really that expensive an operation in JavaScript?...

javascriptarraysmicro-optimization

Read More
When source registers in avx instruction can be reused...

assemblycpu-architecturesimdavxmicro-optimization

Read More
Cost of a 64bits jump, always 10-22 cycles the first time?...

x86x86-64cpu-architecturemicro-optimizationbranch-prediction

Read More
Why does clang's epilogue use `add $N, %rsp` instead of `mov %rbp, %rsp` to restore `%rsp`?...

assemblyclangx86-64micro-optimization

Read More
PHP: What is the fastest and easiest way to get the last item of an array?...

phpmicro-optimization

Read More
Is not having local functions a micro optimisation?...

javascriptmicro-optimization

Read More
Why dependency in a loop iteration can't be executed together with the previous one...

performanceassemblyx86micro-optimizationmicro-architecture

Read More
ARM Cortex M0+: How to use "Branch if Carry" instructions in C-code?...

cassemblyarmcompiler-optimizationmicro-optimization

Read More
Optimizing a C function call using 64-bit MASM...

cassemblyx86-64micro-optimization

Read More
Why is movl preferred to movb when translating a C downcast from unsigned int to unsigned char?...

cassemblygccx86micro-optimization

Read More
can the mips pseudo-instruction la be replace by a single MIPS instruction?...

assemblymipsmicro-optimizationinstructions

Read More
Adding a byte from memory to the AX register...

assemblyx86-16masmmicro-optimizationzero-extension

Read More
Efficient modulo-255 computation...

calgorithmassemblybit-manipulationmicro-optimization

Read More
How to optimize for writes to memory in hot loop...

assemblyx86x86-64micro-optimizationradix-sort

Read More
Are these the smallest possible x86 macros for these stack operations?...

assemblyx86jitmicro-optimizationstack-machine

Read More
Why does breaking the "output dependency" of LZCNT matter?...

performanceassemblyx86cpu-architecturemicro-optimization

Read More
Fastest way to strip all non-printable characters from a Java String...

javastringperformanceoptimizationmicro-optimization

Read More
Code alignment dramatically affects performance...

c++performanceassemblyx86micro-optimization

Read More
printing a signed integer optimization...

assemblyx86-64nasmmicro-optimization

Read More
High Performance Bit Removal (XOR vs. subtract)...

cxormicro-optimizationbranch-prediction

Read More
u8 as i32 vs i32::from_be_bytes([u8;4])...

rustmicro-optimization

Read More
Why do gcc and clang generate mov reg,-1...

assemblygccclangx86-64micro-optimization

Read More
Check if ax is divisible by 16...

assemblyoptimizationx86micro-optimization

Read More
Compact shellcode to print a 0-terminated string pointed-to by a register, given puts or printf at k...

linuxassemblyx86-64micro-optimizationshellcode

Read More
Count integers in [1..N] with K zero bits below the leading 1? (popcount for a contiguous range with...

optimizationx86nasmmicro-optimizationhammingweight

Read More
Bottleneck when using indexed addressing modes...

x86-64intelcpu-architecturemicro-optimizationaddressing-mode

Read More
Does optimizing an algorithm from O(2N) down to O(N) make it twice as fast?...

optimizationtime-complexitybig-omicro-optimizationspace-complexity

Read More
BackNext