I am using NSight to debug my CUDA code and I have question: how can I place a breakpoint for a specific thread and block?
When I place a breakpoint on the kernel the debugger always stops at thread 0 of block 0.
try to use CUDA Debug Focus. you can debug any thread in any block you want...