I am trying to create a proof of concept showing that it is possible for a company to migrate their data from their oracle 12c DB to a hadoop system..
To do this I have started a Oracle Linux 7 instance on AWS. I am planning to install Oracle 12c on it.. after that I have to create dummy tables and send them to Hadoop..
can this be done? From my initial research I can see that Sqoop and Oracle GoldenGate can do that.. also if this goes live i would have to transfer billions of records from oracle 12c to HDFS..
Any help or advice is much much appreciated..
You must use sqoop for transform your oracle data to hive. You can read this for more info