Search code examples
hadoophiveclouderaamazon-emrimpala

How to install cloudera impala on EMR?


Is there anyway i can install the only impala without cloudera manager and without cdh. I will be using the apache version of hadoop?


Solution

  • Yes, it is absolutely possible. Add the repository into your sources.list file and update the repository after that.

    deb [arch=amd64] http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib deb-src http://archive.cloudera.com/impala/ubuntu/precise/amd64/impala precise-impala1 contrib

    After that, it's merely :

    sudo apt-get install impala                (Binaries for daemons)
    sudo apt-get install impala-server         (Service start/stop script)
    sudo apt-get install impala-state-store    (Service start/stop script)
    

    But do not forget to meet all the prerequisites. For a detailed info you can go here