Search code examples
Will a modern processor (like the i7) follow pointers and prefetch their data while iterating over a...


c++performancecachingpointerscpu-cache

Read More
Binary search with looking nearby values...


performancecontainersbinary-searchcpu-cache

Read More
Is there a cache in the ARM Cortex-M4?...


cachingarmmicrocontrollercpu-cachecortex-m

Read More
How much of ‘What Every Programmer Should Know About Memory’ is still valid?...


optimizationmemoryx86cpu-architecturecpu-cache

Read More
Why does Intel use a VIPT cache and not VIVT or PIPT?...


cachingmemoryintelcpu-architecturecpu-cache

Read More
Why is a linear search through a heap data structure faster than tree traversal?...


javaperformanceheapcpu-cache

Read More
Is CUDA shared memory also cached...


cudagpucpu-cachegpu-shared-memory

Read More
What's the point of cache clean _and_ invalidate in ARM Cortex processors?...


cachingembeddedcpu-cachecortex-m

Read More
Does Intel Cache Allocation Technology allow hits from CPUs in one group on cache lines in another g...


cpucpu-architectureintelcpu-cache

Read More
what is the purpose of using index caches in rigtorp's SPSCQueue...


queuecpu-architecturecpu-cachemicro-optimizationlock-free

Read More
Does the last level cache see the PC?...


cpucpu-architectureintelcpu-cacheprefetch

Read More
What is a "cache-friendly" code?...


c++performancecachingmemorycpu-cache

Read More
L1 Cache Usage in Optimised matrix multiplication micro-kernel in C++...


c++optimizationmatrix-multiplicationavxcpu-cache

Read More
How to programmatically get the CPU cache line size in C++?...


c++linuxwindowscpucpu-cache

Read More
C optimization: conditional store to avoid dirtying a cache line...


ccachingcpu-cachemicro-optimizationlibuv

Read More
What is the difference in logic and performance between LOCK XCHG and MOV+MFENCE?...


multithreadingassemblyconcurrencyx86cpu-cache

Read More
What's the theory and measurements behind cache line sizes?...


cpucpu-architecturecpu-cache

Read More
Explain this CPU Cache processor effect: Number of operations decreased exponentially but average ti...


c++cpucompiler-optimizationcpu-cache

Read More
reduce the cache misses by increasing size of array - why does this work?...


cachingcpu-architecturecpu-cache

Read More
CPU Registers and Cache Coherence...


multithreadingcpu-architecturelock-freecpu-cache

Read More
Why are most cache line sizes designed to be 64 byte instead of 32/128byte now?...


cpu-cache

Read More
Does the processor copy the same bloc from RAM to all caches?...


cachingcpucpu-architecturecpu-cache

Read More
Why do we even need cache coherence?...


ccpucpu-architecturecpu-cache

Read More
Usermode CPU Data cache invalidation / flush on Linux (Cortex A53)...


linuxcachingarmcpu-cacheusermode

Read More
What does a cache line in a CPU consist of besides the usual tags, data, and dirty+valid bits?...


cachingcpuintelcpu-architecturecpu-cache

Read More
Iteration over 2d matrix generates more cache references in perf after switching order of indices...


linuxx86-64cpu-cacheperfamd-processor

Read More
Are Lisp lists always implemented as linked lists under the hood?...


linked-listlispcpu-cache

Read More
Analysing performance of transpose function...


openmphpccpu-cachemicrobenchmarkmemory-bandwidth

Read More
Are there any modern CPUs where a cached byte store is actually slower than a word store?...


performancex86armcpu-architecturecpu-cache

Read More
Can all of L2/L3 cache be used by data? If so, why does the Graviton 3 bandwidth plot drop off after...


performancecpu-architecturearm64cpu-cachememory-bandwidth

Read More
BackNext