Search code examples
pythoninstallationanacondacondaspark-koalas

How to install koalas with conda?


According to the koalas docs, installing koalas with conda should be as easy as

conda install koalas

But this raises the error:

Collecting package metadata (repodata.json): done Solving environment: failed with initial frozen solve. Retrying with flexible solve.

PackagesNotFoundError: The following packages are not available from current channels:

  • koalas

Current channels:

To search for alternate channels that may provide the conda package you're looking for, navigate to

https://anaconda.org

and use the search bar at the top of the page.

I tried running

conda install -c conda-forge pyspark

first too, and that seemd to work out just fine. But koalas still raises the same error. Is there something more to

Koalas requires PySpark so please make sure your PySpark is available.

than conda install -c conda-forge pyspark that is too obvious to include in the docs?

I've also tried the whole thing in a virtual environment with

conda create --name koalas-dev-env

if that should matter.

System info:

Python 3.7.0
Anaconda 2019.10

Solution

  • Running

    conda create -c conda-forge -n koalas-dev-env koalas pyspark

    will do. You run into issue since you have not provided the conda-forge channel. The latter you can also add to the .condarc config file.

    [EDIT]

    As pointed out by the OP, this should have been made clear in the docs. I made a pull request to fix that.