I'm new to NSIGHT and CUDA. I tried to set a breakpoint inside my CUDA kernel code, but I can't--the breakpoint is set at the end of my kernel and not on the particular line I want to debug.
I am currently running the program through Menu->Nsight->Start CUDA debugging. When I try to set a breakpoint on a different project (which is "StartUp project"), i do succeed. Any suggestions about how I can get the breakpoint to act on a particular line, versus the entire kernel?
I used too many threads (256X256) to activate my kernel. dim3 threads(256,256) (kernel<<<...,threads>>>