Search code examples
Run time and reported cycle counts in linux perf...

performancex86-64cpu-architecturematrix-multiplicationperf

Read More
Which alignment causes this performance difference...

performancex86cpu-architecturememory-alignmentperf

Read More
using "Perf stat" to profile both process and system-wide events simultaneously...

linuxperf

Read More
what is `__GI_memset`? why does it cost so much CPU resource?...

libcperfmemset

Read More
Counting L3 cache access event on Amd Zen 2 processors...

linuxx86cpu-cacheperfamd-processor

Read More
AMD perf events...

x86perfamd-processor

Read More
Understanding the impact of lfence on a loop with two long dependency chains, for increasing lengths...

performanceassemblyx86cpu-architectureperf

Read More
Eclipse profiling with perf - PERF_FLAG_FD_CLOEXEC failed with unexpected error 13 (permission denie...

eclipseprofilingperf

Read More
Could I load elf and get virtual address from /proc/pid/maps before actually running it?...

clinuxbinarymemory-addressperf

Read More
Access behavior of MM_PREFETCH intrinsic - Intel...

x86intelcpu-architectureperfprefetch

Read More
how to access RAPL via perf with Rocket Lake?...

x86intelperfamd-processorenergy

Read More
Perf instruction/cycles count in userspace/kernelspace alone in Linux...

linuxlinux-kernelprofilingperformancecounterperf

Read More
Perf report's function at 0xffffffffffffffff...

performancelinux-kernelperf

Read More
Why is moving data between two CPU registers so slow that it costs 30% of total time?...

c++performanceg++perfmov

Read More
Redirect perf ouput...

pythonperf

Read More
perf_event_open always returns -1...

linuxarmsystem-callsperf

Read More
Understanding Linux perf FP counters and computation of FLOPS in a C++ program...

c++linuxprofilingperfflops

Read More
Performance Counters and IMC Counter Not Matching...

linux-kernelperformancecounterperfmemory-accessintel-pmu

Read More
Perf dynamic tracing Failed to add events...

javalinuxjvmperf

Read More
Understanding the Frequency printed by Linux Perf Stat...

linuxperformancecounterperfcpu-cycles

Read More
How can you get frame-pointer perf call stacks/flamegraphs involving the C++ standard library?...

c++dockerperflibstdc++flamegraph

Read More
Thread Utilization profiling on linux...

linuxmultithreadingperformanceprofilingperf

Read More
What is the default behavior of perf record?...

linuxperf

Read More
perf record with --call-stack fp fails to unwind main function...

c++glibclibcperfstack-unwinding

Read More
Required compile flags in order to user perf...

c++linuxprofilingcompiler-optimizationperf

Read More
How does "perf sched record" count context switches?...

linuxperformanceschedulerperfcontext-switch

Read More
Frequency-based sampling of multiple threads with perf record...

multithreadingprofilingperf

Read More
How do I sample all threads and record their thread id with perf?...

perf

Read More
Is there a way to profile a MPI program with detailed cache/CPU efficiency information?...

c++mpiperfgperf

Read More
Using perf to monitor raw event counters...

linuxprofilingintelperf

Read More
BackNext