Search code examples
cudanvidiansight

CUDA - more SM or higher clock rate?


What counts more when CUDA kernel speed execution is of vital importance? The frequency of the cores or the number of the SMs?

I can choose between a Quadro K5000 and a Gtx 670 and I cannot decide. Memory seems enough in both cases but the quadro has more SMs while the Gtx has a higher clock rate (I suppose this value is per-core).


Solution

  • Depends in what you are trying to execute. Will your program make use of all the cores of the Quadro? If not, the Gtx will be faster. If it does and the Gtx would need more than 1 grid, you should do the math, but probably the Quadro will be faster.