I want to make calls to cuBLAS routines asynchronously. Is it possible? If yes, how can I achieve that?
Use the cublasSetStream function before the cublas calls.
cublasSetStream
cublasSetStream(cublasHandle, cudaStream);
cublasSetStream(cublasHandle, cudaStream)