How does one export a 50 GB column family from Cassandra to a CSV file? I have explored:
1.) Copy ... To - this results in an rpc_timeout
2.) Have cassandra output the results of a query directly to a text a file - results in rpc_timeout
I have manipulated the read and write request timeout values (restarting dse everytime) in the cassandra.yaml file with no success. How do you export this data to CSV?
Try Brian Hess's cassandra unloader. Its probably the most user friendly and performant tool out there.
It might become part of the core project.
https://issues.apache.org/jira/plugins/servlet/mobile#issue/CASSANDRA-9048