Search code examples
hadooptwittertwitter4j

Fetch twitter data in hadoop


Please guide me on how to actually load twitter data in Apache hadoop and analyse it. I have heard it is done by using twitter API keys, but can anybody help figure out the steps


Solution

  • Check this github project for analyzing tweets in hadoop.

    https://github.com/cloudera/cdh-twitter-example

    This page also includes how to setup flume, hive & oozie.

    Installing hadoop, flume, oozie, hive: http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/CDH4-Installation-Guide/CDH4-Installation-Guide.html