Search code examples
cudanvcc

How do we use cuPrintf()?


What do we have to do to use cuPrintf()? (device compute capability 1.2, Ubuntu 12) I couldn't find "cuPrintf.cu" and "cudaPrintf.cuh", so i downloaded their code and include them:

#include "cuPrintf.cuh"
#include "cuPrintf.cu"

By the way this is the rest of the code:

__global__ void hello_kernel (float f) {
printf ("Thread number %d. f = %d\n", threadIdx.x, f);
}

    int main () {
    dim3 gridSize = dim3 (1);
    dim3 blockSize = dim3 (16);
    cudaPrintfInit ();
    hello_kernel <<< gridSize, blockSize >>> (1.2345f);
    cudaPrintfDisplay (stdout, true);
    cudaPrintfEnd ();
    return (0);
}

But nvcc still gives a mistake:

max@max-Lenovo-G560:~/CUDA/matrixMult$ nvcc printfTest.cu -o printfTest

printfTest.cu(5): error: calling a __host__ function("printf") from a __global__
function("hello_kernel") is not allowed

Thanks!


Solution

  • In your kernel instead of this:

    printf ("Thread number %d. f = %d\n", threadIdx.x, f);
    

    you should do this:

    cuPrintf ("Thread number %d. f = %d\n", threadIdx.x, f);
    

    Other than that, I believe your code is correct (it works for me).

    This SO question/answer gives more tips about using cuPrintf properly.