Search code examples
pythonhadoopapache-sparkspark-graphx

How to run GraphX on IPython Notebook?


I'm trying to run GraphX on Ipython notebook. Firstly, I launched Spark/Hadoop clusters and then launched ipython notebook using this tutorial (http://blog.insightdatalabs.com/jupyter-on-apache-spark-step-by-step/). But now I have only Python 2 kernel and I need GraphX to work with graph. How can I run it?


Solution

  • GraphX can only be referenced by the Scala API. Saying this you can either use something like Jupyter-Scala so you can use Scala within Jupyter or you can use GraphFrames which supports the Python API. For more information on GraphFrames, please check out Introducing GraphFrames.