I am uploading data from .csv files into Cassandra cluster deployed in Amazon EC2 using COPY command from EBS volume attached to it. I noticed that the Cassandra upload time increases very badly, with increase in size of the .csv file.
Is there a way in which I can tune the settings to increase the load rate for cassandra.. ?
Use a real bulk loader. COPY is not appropriate for millions of records.