Please guide me on how to actually load twitter data in Apache hadoop and analyse it. I have heard it is done by using twitter API keys, but can anybody help figure out the steps
Check this github project for analyzing tweets in hadoop.
https://github.com/cloudera/cdh-twitter-example
This page also includes how to setup flume, hive & oozie.
Installing hadoop, flume, oozie, hive: http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Installation-Guide/CDH4-Installation-Guide.html