Search code examples
apache-sparkamazon-emrsparkrapache-zeppelin

Running R on amazon EMR with spark 1.6 and Zeppelin 0.5.6


I am trying to setup the R interpreter to run in Zeppelin which is currently running on EMR. Zeppelin is working perfectly and I am able to write script in Scala and Python. When I use %r, %sparkR or %knitr I receive an error : "r interpreter not found"

The applications which I have running in my emr-4.7.2 cluster are: Hive 1.0.0, Zeppelin-Sandbox 0.5.6, Spark 1.6.2, Pig 0.14.0

Within the interpreter there is no mention of R so figure I am missing something but do not know what.

Any pointers greatly appreciated.


Solution

  • Zeppelin on Amazon EMR (till at least emr-5.0.0) does not support the SparkR interpreter.

    You ought following the Elastic Map Reduce Release Guide/Zeppelin documentation to get more information.