Search code examples
What do the letters in port usage on uops.info mean?...


x86cpucpu-architectureintel

Read More
Determine target ISA extensions of binary file in Linux (library or executable)...


linuxshared-librariesexecutablecpu-architectureinstruction-set

Read More
Do assembly instructions map 1-1 to machine language?...


assemblycpu-architecturemachine-code

Read More
Slowing down CPU Frequency by imposing memory stress...


c++linuxcpuintelcpu-architecture

Read More
If cache invalidation happens every time memory mappings change, why not opt for VIVT?...


cachingx86cpucpu-architecturecpu-cache

Read More
Can addition be done in less than a cycle when outputs depend on each other?...


assemblyx86cpu-architectureintel

Read More
How do modern Intel x86 CPUs implement the total order over stores...


x86intelcpu-architecturememory-barriersmesi

Read More
Understanding synchronization with multiple processors...


javamultithreadingcpu-architectureatomiccompare-and-swap

Read More
Difference between low and high 8-bit registers; do their values use bits in opposite bit-endian ord...


x86cpu-architecturecpu-registers

Read More
How to start learning assembly language on any system...


assemblycpu-architectureportabilityplatform-independent

Read More
Was there any advantage to the 386 architecture making 16-bit register arithmetic leave upper bits u...


assemblyx86cpu-architecturehardwarecpu-registers

Read More
Was there a P4 model with double-pumped 64-bit operations?...


x86x86-64intelcpu-architecture

Read More
Atomicity of loads and stores on x86...


c++x86cpu-architectureatomicmemory-barriers

Read More
optimal to flush low-contention atomic from caches?...


multithreadingcpu-architectureatomiccpu-cachemesi

Read More
CPU operations during g++ compiling...


compilationg++cpucpu-architecturebuild-server

Read More
Are programs compiled for RV32E guaranteed to produce equivalent results on RV32I machines?...


assemblycpu-architecturecpu-registersriscv

Read More
Why does floating-point output differ across platforms?...


javajdbcfloating-pointcpu-architectureieee-754

Read More
How do I force the CPU to perform in order execution of a program without any loops or branches?...


gccx86cpucpu-architecture

Read More
What's the purpose of the rotate instructions (ROL, RCL on x86)?...


assemblyx86cpu-architecturebit-shiftinstruction-set

Read More
Does INVLPG instruction or mprotect() affect the CPU cache state while invalidating TLB entries?...


assemblyx86cpu-architecturecpu-cachetlb

Read More
AVX2 / gcc: Improve CPU-level parallelism by using different registers...


gccvectorizationcpu-architecturesimdavx2

Read More
How much of ‘What Every Programmer Should Know About Memory’ is still valid?...


optimizationmemoryx86cpu-architecturecpu-cache

Read More
What does memory_order_consume really do?...


c++cpu-architecturelock-freememory-modelstdatomic

Read More
How does a pipelined processor guarantee instruction atomicity so they don't conflict, and so in...


cpu-architectureatomicprocessor

Read More
Handling Precise Exceptions in Tomasulo...


exceptionmipscpu-architecture

Read More
How do machines interpret binary?...


binaryoperating-systemcpucpu-architecture

Read More
Why is a CPU branch instruction slow?...


optimizationlanguage-agnosticcpucpu-architecturebranch-prediction

Read More
Why is processing a sorted array faster than processing an unsorted array?...


javac++performancecpu-architecturebranch-prediction

Read More
Why is Skylake so much better than Broadwell-E for single-threaded memory throughput?...


performancex86cpu-architecturebenchmarkingmemory-bandwidth

Read More
Why is acquire semantics only for reads, not writes? How can an LL/SC acquire CAS take a lock withou...


assemblycpu-architecturestdatomiccompare-and-swapspinlock

Read More
BackNext