HBase: do I need jobtracker/tasktracker

Go To StackoverFlow.com


If I don't do any map/reduce jobs, still JobTracker/TaskTrackers need to be running for some HBase internal dependency?

2012-04-04 07:25
by kee


No you don't need both for running solely HBase.

Just a tip: there are always scripts that just start the HDFS, bin/start-dfs.sh for example.

2012-04-04 07:39
by Thomas Jungblut


As mentioned above we don't need Job/Tasktracker if we are dealing with just Hbase. You can use bin/start-dfs.sh to start Name/Dtanodes..Moreover bin/start-all.sh has been deprecated now..So you should prefer using bin/start-dfs.sh to start Name/Datanodes and bin/start-mapred.sh to start Job/Tasktracker..I would suggest using Hbase in pseudo-distributed mode for learning and testing purpose, as in standalone Hbase doesn't use HDFS..You should be a bit careful while configuring though..

2012-04-07 20:53
by Tariq


Basic case: You don't need JobTracker and TaskTrackers when using only HDFS+HBase (in smaller, testing environment you don't need event HDFS)

When you would like to run MapReduce jobs using data stored in HBase, you'll obviously need both JobTracker and TaskTrackers.

2012-04-04 10:36
by wlk