This is my scenario. I program my CUDA application on windows machine. I compile and run this application on remote linux (Debian) server (without graphical output) using putty.
I want to ask what is the best way to debug and profile my application. I read something about Nvidia product Parallel Nsight and Parallel Nsight Monitor. Is this the (only) way?
Using Nsight Eclispe Edition 6.0 running, debugging and profiling of CUDA application on remote machine have become very comfortable and easy. Everything works on one-click. To be able to use remote debugging CUDA SDK of version 5.5 and remote profiling version 6.0 are necessary on the remote machine.
Details can be found in Nsight Eclipse documentation.