Search code examples
jdbccassandrathriftspark-cassandra-connectorbeeline

sql query to spark-cassandra cluster thriftserver does not return


I have a spark-cassandra cluster set up. cqlsh works just fine. I started the thriftserver and use beeline command line to connect to it. I can do

use schemaname
show schemas
show tables

but a simple

select * from tablename 

gets stuck.

I tried a java client going thru the jdbc:hive2 interface, it shows the same behavior. Good results on show tables, but no return on a simple select statement.

The spark console shows all the jobs with no sign of complaint. the job is stuck on the Compiled stage.

Any one knows how to diagnose this deeper?


Solution

  • I am fine with the coding. This problem is solved. It is because this set up does not go thru yarn, and the first instance of thriftserver hogged up all cores in the cluster. I fired the sql against the second thriftserver instance which has 0 cores at its disposal.