Search code examples
cassandradatastaxdatastax-enterprise

DSE Loading Data using Bulk Loader


Currently, I have successfully installed the necessary nodes and datacenters through the usage of the OpsCenter.

I have also generated the necessary table and Keyspace using Cassandra through DataStax Studio

KeySpace Generated

CREATE KEYSPACE graph_tables WITH REPLICATION = {'class':'SimpleStrategy', 'replication_factor':1};

Table Generated

CREATE TABLE people_node (id text, name text, age int, location 'PointType', gender text, dob timestamp, PRIMARY KEY(id));

Sample Data

id, name , age, location, gender, dob
0,  Betsy, 15 , 10 15   , F     , 1997-09-21T12:55:54

Assuming we have a node_1 with the IP Address 1.1.1.1 and second node called node_2 with the IP Address 2.2.2.2. These will be the two nodes that the OpsCenter have installed Cassandra on

From here I attempted to insert the necessary data using dsbulk

dsbulk load -url ./people_node_csv -k graph_tables -t people_node -h '1.1.1.1, 2.2.2.2 ' -header true

However, this results in an error stating "Operation Load_..... failed: Authentication error on host /1.1.1.1:9042: Host /1.1.1.1:9042 requires authentication, but no authenticator found in Cluster Configurations". I attempted to resolve this by adding in "driver.ssl.keystone.password = cassandra" as shown in the Document. But the error still persist. Any advise on solving this issue will be greatly appreciated.


Solution

  • You need to provide following settings as described in documentation:

    • -u - to specify user name
    • -p - to specify password
    • --driver.auth.provider DsePlainTextAuthProvider - to select corresponding authentication provider.