Search code examples
nvprof option for bandwidth...


cudaprofilingnvprof

Read More
Roofline Model with CUDA Manual vs. Nsight Compute...


cudansightnvprofnsight-computeroofline

Read More
nvprof --metrics works with c++ executable but not with fortran executable...


c++cudafortrannvprof

Read More
How to observe CUDA events and metrics for a subsection of an executable (e.g. only during a kernel ...


cudaprofilingnvvpnvprof

Read More
nvprof is using all available GPU's when profiling python script...


pythoncudaprofilingnvprof

Read More
Profiling arbitrary CUDA applications...


linuxcudanvprofnvvp

Read More
Profilers (nvvp and nvprof) not showing "Page Fault" information...


windowscudanvprofnvvp

Read More
nv-nsight-cu-cli caused Tensorflow to fail...


tensorflowgpunvidianvprofnsight-compute

Read More
Meaning of the "flop_count_sp" and "inst_fp_32" metric in CUDA Profiler...


cudagpuprofilernvprofnvvp

Read More
NVIDIA Visual Profiler: Insufficient kernel bounds data...


cudanvprofnvvp

Read More
Why don't I get "thread_inst_executed"...


cudagpuprofilingnvidianvprof

Read More
dram_write_bytes result on P100...


cudanvprof

Read More
How are the blocks scheduled into the SMs in CUDA when their number is lesser than the available SMs...


cudaprofilinggpunvidianvprof

Read More
Running nvprof --metrics command under windows gives an error:cuda profiling error...


cudametricsnvprof

Read More
How to stop running TensorRT server without using ctrl-c (for profiling with nvprof)...


dockertensorrtnvidia-dockernvprofnvvp

Read More
nvprof warning on CUDA_VISIBLE_DEVICES...


pythoncudapytorchnvprof

Read More
No GPU activities in profiling with nvprof...


c++ccudanvidianvprof

Read More
nvprof not picking up any API calls or kernels...


ccudaprofilingnvprof

Read More
Why nvprof does not have metrics on floating point division operations?...


cudafloating-pointnvprof

Read More
Do the SM's shown in the "occupancy graph" correspond to `blockIdx.x` or register `%sm...


cudanvprof

Read More
Issued load/store instructions for replay...


cudanvidianvprof

Read More
How to profile OpenCL application with CUDA 8.0 nvprof...


cudaprofilingopenclnvprof

Read More
How to get malloc to show up in nvprof's statistical profiler?...


cudaprofilingnvprof

Read More
nvprof is crashing as it writes a very large file to /tmp/ and runs out of disk space...


cudanvprof

Read More
Where is the boundary of start and end of CPU launch and GPU launch of Nvidia Profiling NVPROF?...


cudagpuprofilingnvprofnvvp

Read More
local cache hit metric in cuda profiler...


cudanvprof

Read More
FLOP efficiency in CUDA...


cudanvprof

Read More
What is the difference between 'GPU activities' and 'API calls' in the results of &#...


c++cudanvprof

Read More
Unable to import nvprof generated profile data...


pythoncudatensorflownvprof

Read More
Why operations in two CUDA Streams are not overlapping?...


cudanvprofcuda-streamsnvvp

Read More
BackNext