Search code examples
amazon-web-servicesamazon-ec2cassandracql3cqlsh

increase upload efficiency for cassandra


I am uploading data from .csv files into Cassandra cluster deployed in Amazon EC2 using COPY command from EBS volume attached to it. I noticed that the Cassandra upload time increases very badly, with increase in size of the .csv file.

Is there a way in which I can tune the settings to increase the load rate for cassandra.. ?


Solution

  • Use a real bulk loader. COPY is not appropriate for millions of records.