I'm doing some statistics calculations. I need them to be fast, so I rewrote most of it to use SSE. I'm pretty much new to it, so I was wondering what the right approach here is:
To my knowledge, there is no log2 or ln function in SSE, at least not up to 4.1, which is the latest version supported by the hardware I use.
Is it better to:
There seem to be a few SSE log2
implementations around, e.g. this one.
There is also the Intel Approximate Maths Library which has a log2
function among others - it's old (2000) but it's SSE2 and it should still work reasonably well.