Search code examples
rdistributed-computing

What are the cons and pros between SparkR and Revolution R?


There are 2 ways to run the R parallelly, which are SparkR and RevolutionR.

From the compatibility with R,distribution processing effect, scalability, application scenarios, community support and maybe some other aspects, what are the real difference between these two?


Solution

  • As I known, the biggest difference is that Revolution R is a commercial software while SparkR is free. So, you even can't try most of the parallel functions of Revolution R before paid.

    Wait for other guys w/ real experience of Revolution R to update the answer :)

    Btw, there are not only 2 ways to run R parallel. Other approaches, such as snow, multicore, parallel, Rmpi, ... And offload to GPU as here.


    Check out below two links:

    RevolutionR

    SparkR